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CCCGTGCCCC 


TAAAGGCCGC 


CGAGAAAGCT 


AAGTCCAAAT 


GTGACGTCGG 


AGGTCTCGAC 


60 


ATGGTCGCCA 


ACCCTCCAAA 


TGCTACCCGC 


CGGCCCACGC 


AACGCGGGCT 


TTTATAAAGA 


120 


TGGCGCGCGA 


GACAATAACA 


CTTACTCATC 


CGCGTACGCG 


TTTATTATTG 


TCAATATTTG 


180 


TGTGGTTATT 


ATTACTGCTA 


CCGCCCTTGT 


TTCTGCAAGG 


CCCTCGCCGC 


GGCCCAGGCC 


240 


ACTATTC CGG 


CAGCGGCCGC 


CGACGCGGCG 


AGCGTCGCCG 


CTAACGTCGG 


CGCCGCGGGG 


300 


AGCGGGGTTT 


CTTCGACTTA 


AATAGACTCC 


CGAGAAAAAA 


TTTTGGCTGC 


CGTTCGCCAT 


360 


CATCCGAGTC 


GGAAACACAG 


TATGCGGCCG 


AGTTAGGTTT 


TACTTTTAAA 


AACTTTACCG 


420 


TGCTGTACGG 


CCAGGGCGTT 


CTCAGGCTCG 


AAGGGGCAAG 


AGTTGTCCAG 


ACTGATGGGT 


480 


GACTCAGAGA 


CAGCGTTGTC 


TTGTCTCCGT 


TTACCAAAAA 


TATTTCCACT 


CCTCTCTCAA 


540 


AATTTTTACC 


TCCGGTTTCG 


GTAATTAGGA 


AAGTTTTTGG 


CGCAGGGAGG 


TTTAAAG CTG 


600 


CCATGCATAT 


GTCAGCGGTA 


CCCAGCACCC 


ACAAATGGAA 


CTCTTTTGCG 


GCATACGCGC 


660 


CAGATGACAA 


ATGGTAAAAC 


CCTGCGTCCA 


AGCCGCTCCA 


CTCGGGACTT 


ACTCCAGGCG 


720 


GGTCGCCCCC 


CTCACCGAAC 


CGAATCACGG 


GTCTGCACAT 


CCTGGGAAGG 


GAAAACAGCT 


780 


CCCCGGAAAC 


TTCGTACAGA 


GATGCCGGGC 


GCACGATTAC 


CGATAATGTA 


CTCGGACGAT 


840 


CGTAACTCGC 


CATAGTTTTC 


ACTGCGTGAA 


CCAATTCTTT 


CCATCCAGAA 


TCCGAGAGCT 


900 


CAAATCTAGA 


ATTAGGTAGT 


TTGTAGTGCG 


AATCGACCGC 


AGAAACTATA 


GTCACTTTTA 


960 


CAGGCGCCAT 


CGCCGCTCAG 


ACTCCACCCC 


GCTATGATGT 


CAGAAATATA 


ACGCTCTTAT 


1020 


TCTAGCAGAG 


TCAGGCCAAT 


ATATACAGCT 


TAGAGAAG ATG CGG TTT CGG CGC 


1073 



Met Arg Phe Arg Arg 
1 5 

ATC TGT TCA CGC TCT AGG GCA GAA AAA CGA AGA AGA ACA ACC GAG AAT 1121 
lie Cys Ser Arg Ser Arg Ala Glu Lys Arg Arg Arg Thr Thr Glu Asn 
10 15 20 



CCG CTT ACC TCA AAA CGC GTT TGC GTA TTG GAT AGT TTC TCA CGG ACA 
Pro Leu Thr Ser Lys Arg Val Cys Val Leu Asp Ser Phe Ser Arg Thr 
25 30 35 

ATG TCA TTG CGC CCC TAT GCA GAA ATT TTG CCG ACC GCG GAA GGC GTC 
Met Ser Leu Arg Pro Tyr Ala Glu lie Leu Pro Thr Ala Glu Gly Val 
40 45 50 

GAG CGC CTC GCC GAA CTT GTT AGT GTG ACA ATG ACA GAA CGC GCG GAA 
Glu Arg Leu Ala Glu Leu Val Ser Val Thr Met Thr Glu Arg Ala Glu 
55 60 65 

CCT GTG ACA GAG AAT ACA GCT GTA AAC AGT ATC CCC CCG GCT AAC GAG 
Pro Val Thr Glu Asn Thr Ala Val Asn Ser He Pro Pro Ala Asn Glu 
70 75 80 85 

AAC GGG CAG AAC TTC GCA TAT GCA GGC GAT GGG CCC TCG ACT ACT GAA 
Asn Gly Gin Asn Phe Ala Tyr Ala Gly Asp Gly Pro Ser Thr Thr Glu 
90 95 100 

AAA GTT GAC GGC TCG CAT ACA GAC TTC GAT GAA GCA TCG AGC GAC TAC 
Lys Val Asp Gly Ser His Thr Asp Phe Asp Glu Ala Ser Ser Asp Tyr 
105 HO H5 

GCC GGC CCT GTC CCG CTC GCG CAA ACT AGA TTG AAG CAT TCG GAT GAA 
Ala Gly Pro Val Pro Leu Ala Gin Thr Arg Leu Lys His Ser Asp Glu 
120 125 130 

TTT CTT CAG CAC TTC CGA GTT TTA GAC GAT TTG GTG GAG GGG GCT TAC 
Phe Leu Gin His Phe Arg Val Leu Asp Asp Leu Val Glu Gly Ala Tyr 
135 140 145 

GGG TTT ATC TGC GGC GTC CGT CGC TAC ACC GAG GAA GAG CAA CGT CGA 
Gly Phe He Cys Gly Val Arg Arg Tyr Thr Glu Glu Glu Gin Arg Arg 
150 155 ISO 165 

AGA GGG GTT AAC AGT ACT AAC CAG GGG AAA TCA AAA TGT AAG CGC CTG 
Arg Gly Val Asn Ser Thr Asn Gin Gly Lys Ser Lys Cys Lys Arg Leu 
170 175 180 

ATA GCT AAA TAT GTG AAA AAT GGA ACA AGG GCG GCC TCT CAG CTG GAA 
He Ala Lys Tyr Val Lys Asn Gly Thr Arg Ala Ala Ser Gin Leu Glu 
185 190 195 

AAT GAA ATT TTG GTT CTC GGG CGC CTA AAT CAC GAG AAT GTT CTC AAG 
Asn Glu He Leu Val Leu Gly Arg Leu Asn His Glu Asn Val Leu Lys 
200 205 210 

ATC CAG GAA ATC CTT CGG TAC CCG GAT AAT ACG TAC ATG TTA ACG CAG 
He Gin Glu He Leu Arg Tyr Pro Asp Asn Thr Tyr Met Leu Thr Gin 
215 220 225 

AGG TAT CAG TTC GAC TTG TAC AGC TAC ATG TAC GAT GAA GCG TTC GAC 
Arg Tyr Gin Phe Asp Leu Tyr Ser Tyr Met Tyr Asp Glu Ala Phe Asp 
230 235 240 245 

TGG AAA GAC AGT CCA ATG CTT AAA CAG ACT AGA CGC ATC ATG AAG CAG 
Trp Lys Asp Ser Pro Met Leu Lys Gin Thr Arg Arg He Met Lys Gin 
250 255 260 

CTC ATG TCA GCG GTC TCG TAT ATC CAT TCA AAG AAA CTG ATT CAC AGG 
Leu Met Ser Ala Val Ser Tyr He His Ser Lys Lys Leu He His Arg 
265 270 275 

GAC ATC AAA CTC GAA AAT ATT TTC TTA AAC TGC GAC GGC AAG ACA GTG 
Asp He Lys Leu Glu Asn He Phe Leu Asn Cys Asp Gly Lys Thr Val 
280 285 290 



CTG GGC GAC TTT GGA ACT GTC ACG CCT TTT GAA AAT GAG CGG GAG CCC 198 5 

Leu Gly Asp Phe Gly Thr Val Thr Pro Phe Glu Asn Glu Arg Glu Pro 
295 300 305 

TTC GAA TAT GGA TGG GTG GGG ACC GTG GCT ACT AAC TCT CCC GAG ATA 2 03 3 

Phe Glu Tyr Gly Trp Val Gly Thr Val Ala Thr Asn Ser Pro Glu He 
310 315 320 325 

CTC GCC AGG GAT TCG TAC TGT GAA ATT ACA GAC ATT TGG AGC TGC GGA 2081 
Leu Ala Arg Asp Ser Tyr Cys Glu He Thr Asp He Trp Ser Cys Gly 
330 335 340 

GTA GTA TTG CTG GAA ATG GTA AGC CAT GAA TTT TGC CCG ATC GGC GAT 212 9 

Val Val Leu Leu Glu Met Val Ser His Glu Phe Cys Pro He Gly Asp 
345 350 355 

GGC GGG GGA AAT CCG CAC CAG CAA TTG CTG AAA GTT ATC GAC TCT CTC 2177 
Gly Gly Gly Asn Pro His Gin Gin Leu Leu Lys Val He Asp Ser Leu 
360 365 370 

TCA GTT TGT GAT GAA GAG TTC CCA GAC CCC CCG TGT AAT CTG TAC AAT 2225 
Ser Val Cys Asp Glu Glu Phe Pro Asp Pro Pro Cys Asn Leu Tyr Asn 
375 380 385 

TAT TTG CAT TAT GCG AGC ATC GAT CGC GCC GGA CAT ACG GTC CCG TCG 22 73 

Tyr Leu His Tyr Ala Ser He Asp Arg Ala Gly His Thr Val Pro Ser 
390 395 400 405 

CTC ATA CGG AAC CTC CAC CTT CCG GCG GAT GTG GAA TAC CCT CTA GTT 2321 
Leu He Arg Asn Leu His Leu Pro Ala Asp Val Glu Tyr Pro Leu Val 
410 415 420 

AAA ATG CTT ACT TTT GAC TGG CGT TTG AGA CCC AGC GCG GCC GAA GTA 2369 
Lys Met Leu Thr Phe Asp Trp Arg Leu Arg Pro Ser Ala Ala Glu Val 
425 430 435 

TTG GCA ATG CCA CTG TTT TCG GCT GAA GAG GAA CGG ACC ATA ACA ATT 2417 
Leu Ala Met Pro Leu Phe Ser Ala Glu Glu Glu Arg Thr He Thr He 
440 445 450 

ATT CAT GGA AAA CAT AAA CCC ATC CGA CCC GAA ATC CGT GCG CGG GTG 2465 
He His Gly Lys His Lys Pro He Arg Pro Glu He Arg Ala Arg Val 
455 460 465 

CCA CGG TCC ATG AGT GAA GGT TAATAATAAA GGACGGAGAT AGAGAACTGA 2516 
Pro Arg Ser Met Ser Glu Gly 
470 475 

AGCGTCAGAT TTTTTTAAAA AAATAAATGA TCGAGAACTT ATGATTTGTC TTTCTTGA 2574 

ATG ACC TTG CCC CAT CGA TTA ACG AAA AGA CCT TTC GCG CGT CGA TTC 2622 
Met Thr Leu Pro His Arg Leu Thr Lys Arg Pro Phe Ala Arg Arg Phe 

1 5 10 .15 

TGC TCG GTC TTT GTG ATA CAT TAT AGT GAG ACT AAA CTC GAC CGA TAT 2670 
Cys Ser Val Phe Val He His Tyr Ser Glu Thr Lys Leu Asp Arg Tyr 
20 25 30 

AAC AAG ACA ATG TTA CTC TAT AGA CCG GAC TCA ACC ATG CGG CAT AGC 2718 
Asn Lys Thr Met Leu Leu Tyr Arg Pro Asp Ser Thr Met Arg His Ser 
35 40 45 

GGA GGC GAC GCA AAT CAC AGA GGG ATA AGG CCG AGG CGG AAA TCT ATT 2766 
Gly Gly Asp Ala Asn His Arg Gly He Arg Pro Arg Arg Lys Ser He 



GGA GCG TTT AGC GCG CGC GAA AAG ACT GGA AAA CGA AAT GCG CTG ACG 
Gly Ala Phe Ser Ala Arg Glu Lys Thr Gly Lys Arg Asn Ala Leu Thr 
65 70 75 80 

GAA AGC AGC TCC TCC TCC GAC ATG CTA GAT CCG TTT TCC ACG GAT AAG 
Glu Ser Ser Ser Ser Ser Asp Met Leu Asp Pro Phe Ser Thr Asp Lys 
85 90 95 

GAA TTT GGC GGT AAG TGG ACG GTA GAC GGA CCT GCC GAC ATT ACT GCC 
Glu Phe Gly Gly Lys Trp Thr Val Asp Gly Pro Ala Asp lie Thr Ala 
100 105 110 

GAG GTC CTT TCT CAG GCA TGG GAC GTT CTC CAA TTA GTG AAG CAT GAA 
Glu Val Leu Ser Gin Ala Trp Asp Val Leu Gin Leu Val Lys His Glu 
115 120 125 

GAT GCG GAG GAG GAG AGA GTG ACT TAT GAG TCC AAA CCG ACC CCG ATA 
Asp Ala Glu Glu Glu Arg Val Thr Tyr Glu Ser Lys Pro Thr Pro He 
130 135 140 

CAG CCG TTC AAT GCC TGG CCG GAC GGG CCG AGT TGG AAC GCG CAG GAT 
Gin Pro Phe Asn Ala Trp Pro Asp Gly Pro Ser Trp Asn Ala Gin Asp 
145 150 155 160 

TTT ACT CGA GCG CCA ATA GTT TAT CCC TCT GCG GAG GTA TTG GAC GCA 
Phe Thr Arg Ala Pro He Val Tyr Pro Ser Ala Glu Val Leu Asp Ala 
165 170 175 

GAG GCG TTG AAA GTA GGG GCA TTC GTT AGC CGA GTT TTA CAA TGT GTA 
Glu Ala Leu Lys Val Gly Ala Phe Val Ser Arg Val Leu Gin Cys Val 
180 185 190 

CCG TTC ACG CGA TCA AAG AAA AGC GTT ACG GTG CGG GAT GCG CAG TCG 
Pro Phe Thr Arg Ser Lys Lys Ser Val Thr Val Arg Asp Ala Gin Ser 
195 200 205 

TTT TTG GGG GAC TCG TTC TGG AGA ATA ATG CAG AAC GTT TAC ACG GTT 
Phe Leu Gly Asp Ser Phe Trp Arg He Met Gin Asn Val Tyr Thr Val 
210 215 220 

TGC TTA CGA CAG CAC ATA ACT CGA CTC AGG CAC CCT TCC AGC AAA AGC 
Cys Leu Arg Gin His He Thr Arg Leu Arg His Pro Ser Ser Lys Ser 
225 230 235 240 

ATT GTT AAC TGC AAC GAC CCT CTA TGG TAC GCC TAC GCG AAT CAA TTT 
He Val Asn Cys Asn Asp Pro Leu Trp Tyr Ala Tyr Ala Asn Gin Phe 
245 250 255 

CAC TGG AGA GGA ATG CGC GTG CCG TCG CTT AAA TTA GCC TCT CCC CCG 
His Trp Arg Gly Met Arg Val Pro Ser Leu Lys Leu Ala Ser Pro Pro 
260 265 270 

GAG GAG AAT ATT CAA CAC GGC CCA ATG GCC GCC GTT TTT AGA AAC GCG 
Glu Glu Asn He Gin His Gly Pro Met Ala Ala Val Phe Arg Asn Ala 
275 280 285 

GGG GCT GGT CTG TTC CTG TGG CCT GCC ATG CGC GCA GCC TTT GAA GAG 
Gly Ala Gly Leu Phe Leu Trp Pro Ala Met Arg Ala Ala Phe Glu Glu 
290 295 300 

CGC GAC AAG CGA CTG TTA AGA GCA TGC CTG TCT TCA CTC GAT ATC ATG 
Arg Asp Lys Arg Leu Leu Arg Ala Cys Leu Ser Ser Leu Asp He Met 
305 310 315 320 

GAC GCA GCC GTC CTC GCG TCG TTT CCA TTT TAC TGG CGC GGC GTC CAA 
Asp Ala Ala Val Leu Ala Ser Phe Pro Phe Tyr Trp Arg Gly Val Gin 
325 330 335 



GAC ACC TCG CGC TTC GAG CCT GCG CTG GGC TGT TTG TCA GAG TAC TTT 
Asp Thr Ser Arg Phe Glu Pro Ala Leu Gly Cys Leu Ser Glu Tyr Phe 
340 345 350 

GCA CTA GTG GTG TTA CTG GCC GAG ACG GTC TTA GCG ACC ATG TTC GAC 
Ala Leu Val Val Leu Leu Ala Glu Thr Val Leu Ala Thr Met Phe Asp 
355 360 365 

CAC GCA CTG GTA TTC ATG AGG GCG CTG GCA GAC GGC AAT TTC GAT GAC 
His Ala Leu Val Phe Met Arg Ala Leu Ala Asp Gly Asn Phe Asp Asp 
370 375 380 

TAT GAC GAA ACT AGA TAT ATA GAC CCC GTT AAA AAC GAG TAC CTG AAC 
Tyr Asp Glu Thr Arg Tyr He Asp Pro Val Lys Asn Glu Tyr Leu Asn 
385 390 395 400 

GGA GCC GAG GGT ACT CTG TTA CGG GGC ATA GTG GCC TCC AAC ACC GCT 
Gly Ala Glu Gly Thr Leu Leu Arg Gly He Val Ala Ser Asn Thr Ala 
405 410 415 

CTG GCG GTG GTT TGC GCA AAC ACC TAT TCG ACG ATA AGA AAA CTC CCG 
Leu Ala Val Val Cys Ala Asn Thr Tyr Ser Thr He Arg Lys Leu Pro 
420 425 430 

TCC GTG GCA ACT AGC GCG TGC AAT GTT GCC TAC AGG ACC GAA ACG CTG 
Ser Val Ala Thr Ser Ala Cys Asn Val Ala Tyr Arg Thr Glu Thr Leu 
435 440 445 

AAA GCG AGG CGC CCT GGC ATG AGC GAC ATA TAC CGG ATA TTA CAA AAA 
Lys Ala Arg Arg Pro Gly Met Ser Asp He Tyr Arg He Leu Gin Lys 
450 " 455 460 

GAG TTT TTC TTT TAC ATT GCG TGG CTC CAG AGG GTT GCA ACA CAC GCA 
Glu Phe Phe Phe Tyr He Ala Trp Leu Gin Arg Val Ala Thr His Ala 
465 470 475 480 

AAT TTC TGT TTA AAC ATT CTG AAG AGA AGC GTG GAT ACG GGC CCC CGC 
Asn Phe Cys Leu Asn He Leu Lys Arg Ser Val Asp Thr Gly Pro Arg 
485 490 495 

CAT TTT TGT TCA GGG CCA GCT CGG AGA AGC GGC TGC AGC AGT TAAATAAA 
His Phe Cys Ser Gly Pro Ala Arg Arg Ser Gly Cys Ser Ser 
500 505 510 

ATG CTC TGC CCC CTT CTC GTG CCG ATT CAA TAT GAA GAC TTT TCG AAG 
Met Leu Cys Pro Leu Leu Val Pro He Gin Tyr Glu Asp Phe Ser Lys 
15 10 15 

GCC ATG GGG TCT GAG CTC AAG AGG GAA AAG TTA GAG ACA TTC GTT AAA 
Ala Met Gly Ser Glu Leu Lys Arg Glu Lys Leu Glu Thr Phe Val Lys 
20 * 25 30 

GCT ATT TCC AGC GAC AGG GAC CCG AGG GGG TCC TTA AGA TTT CTC ATT 
Ala He Ser Ser Asp Arg Asp Pro Arg Gly Ser Leu Arg Phe Leu He 
35 40 45 

TCG GAC CAT GCA AGG GAA ATT ATT GCA GAC GGA GTA CGG TTT AAG CCG 
Ser Asp His Ala Arg Glu He He Ala Asp Gly Val Arg Phe Lys Pro 
50 ~ 55 60 

GTG ATA GAC GAG CCG GTT CGG GCT TCA GTT GCG CTG AGT ACC GCT GCC 
Val He Asp Glu Pro Val Arg Ala Ser Val Ala Leu Ser Thr Ala Ala 
65 ~ 70 75 80 

GCT GGG AAA GTG AAA GCG CGA CGC TTA ACC TCA GTT CGC GCG CCC GTA 
Ala Gly Lys Val Lys Ala Arg Arg Leu Thr Ser Val Arg Ala Pro Val 



CCG CCC GCA GGC GCC GTT TCC GCG CGC CGG AAA TCG GAA ATA TGA TA 4 44 7 

Pro Pro Ala Gly Ala Val Ser Ala Arg Arg Lys Ser Glu He * 
100 105 110 

AAAATGCTTG GCATTTGCGG GCGAAGAGGC GTGATCTGAA GGGCTCCACA ATGACGTAAC 4 507 

TGAGCTACGC ATCCCTATAA AGTGTACSCG CTGACCGCTA GCCCATACAG TGTTACAGGA 4 56 7 

GGGGAGAGAG ACAACTTCAG CTCGAAGTCT GAAGAGACAT C ATG AGC GGC 4 617 

Met Ser Gly 
1 

TTC AGT AAC ATA GGA TCG ATT GCC ACC GTT TCC CTA GTA TGC TCG CTT 466 5 

Phe Ser Asn He Gly Ser He Ala Thr Val Ser Leu Val Cys Ser Leu 
5 10 15 

TTG TGC GCA TCT GTA TTA GGG GCG CCG GTA CTG GAC GGG CTC GAG TCG 4 713 

Leu Cys Ala Ser Val Leu Gly Ala Pro Val Leu Asp Gly Leu Glu Ser 
20 25 30 35 

AGC CCT TTC CCG TTC GGG GGC AAA ATT ATA GCC CAG GCG TGC AAC CGC 4 761 

Ser Pro Phe Pro Phe Gly Gly Lys He He Ala Gin Ala Cys Asn Arg 
40 45 50 

ACC ACG ATT GAG GTG ACG GTC CCG TGG AGC GAC TAC TCT GGT CGC ACC 4 809 

Thr Thr He Glu Val Thr Val Pro Trp Ser Asp Tyr Ser Gly Arg Thr 
55 60 65 

GAA GGA GTG TCA GTC GAG GTG AAA TGG TTC TAC GGG AAT AGT AAT CCC 4857 
Glu Gly Val Ser Val Glu Val Lys Trp Phe Tyr Gly Asn Ser Asn Pro 
70 75 80 

GAA AGC TTC GTG TTC GGG GTG GAT AGC GAA ACG GGC AGT GGA CAC GAG 4905 
Glu Ser Phe Val Phe Gly Val Asp Ser Glu Thr Gly Ser Gly His Glu 
85 90 95 

GAC CTG TCT ACG TGC TGG GCT CTA ATC CAT AAT CTG AAC GCG TCT GTG 4 953 

Asp Leu Ser Thr Cys Trp Ala Leu He His Asn Leu Asn Ala Ser Val 
100 105 110 115 

TGC AGG GCG TCT GAC GCC GGG ATA CCT GAT TTC GAC AAG CAG TGC GAA 5001 
Cys Arg Ala Ser Asp Ala Gly He Pro Asp Phe Asp Lys Gin Cys Glu 
120 125 130 

AAA GTG CAG AGA AGA CTG CGC TCC GGG GTG GAA CTT GGT AGT TAC GTG 504 9 

Lys Val Gin Arg Arg Leu Arg Ser Gly Val Glu Leu Gly Ser Tyr Val 
135 140 145 

TCT GGC AAT GGA TCC CTG GTG CTG TAC CCA GGG ATG TAC GAT GCC GGC 5097 
Ser Gly Asn Gly Ser Leu Val Leu Tyr Pro Gly Met Tyr Asp Ala Gly 
150 155 160 

ATC TAC GCC TAC CAG CTC TCA GTG GGT GGG AAG GGA TAT ACC GGG TCT 5145 
He Tyr Ala Tyr Gin Leu Ser Val Gly Gly Lys Gly Tyr Thr Gly Ser 
165 170 175 

GTT TAT CTA GAC GTC GGA CCA AAC CCC GGA TGC CAC GAC CAG TAT GGG 5193 
Val Tyr Leu Asp Val Gly Pro Asn Pro Gly Cys His Asp Gin Tyr Gly 
180 185 190 195 

TAC ACC TAT TAC AGC CTG GCC GAC GAG GCG TCA GAC TTA TCA TCT TAT 5241 
Tyr Thr Tyr Tyr Ser Leu Ala Asp Glu Ala Ser Asp Leu Ser Ser Tyr 
200 205 210 



GAC GTA GCC TCG CCC GAA CTC GAC GGT CCT ATG GAG GAA GAT TAT TCC 
Asp Val Ala Ser Pro Glu Leu Asp Gly Pro Met Glu Glu Asp Tyr Ser 
215 220 225 
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AAT TGT CTA GAC ATG CCC CCG CTA CGC CCA TGG ACA ACC GTT TGT TCG 53 3 7 

Asn Cys Leu Asp Met Pro Pro Leu Arg Pro Trp Thr Thr Val Cys Ser 
230 235 240 

CAT GAC GTC GAG GAG CAG GAA AAC GCC ACG GAC GAG CTT TAC CTA TGG 53 8 5 

His Asp Val Glu Glu Gin Glu Asn Ala Thr Asp Glu Leu Tyr Leu Trp 
245 250 255 

GAC GAG GAA TGC GCC GGT CCG CTG GAC GAG TAC GTC GAC GAA AGG TCA 54 33 

Asp Glu Glu Cys Ala Gly Pro Leu Asp Glu Tyr Val Asp Glu Arg Ser 
260 265 270 275 

GAG ACG ATG CCC AGG ATG GTT GTC TTT TCA CCG CCC TCT ACG CTC CAG 54 81 

Glu Thr Met Pro Arg Met Val Val Phe Ser Pro Pro Ser Thr Leu Gin 
280 285 290 

CAG TAGCCACCCG AGAGTGTTTT TTGTGAGCGC CCACGCAACA TACCTAACTG 5534 
Gin 



CTTCATTTCT GATCAATTAT TGCGTATTGA ATAAATAAAC AGTACAAAAG CATCAGGTGT 55 94 

GGTTTGCGTG TCTGTGCTAA ACCATGGCGT GTGCGGGTGA AACCGTAAAT TACGTGATAA 56 54 

T AAAT AG CAT AGGAGTTGGC GTGCAGCGTA TTTCGCCGAG AG ATG GGG ACA ATG 5 708 

Met Gly Thr Met 
1 

TTA GTG TTG CGC CTT TTC CTA CTT GCA GTA GCG GAC GCG GCG TTG CCG 5756 
Leu Val Leu Arg Leu Phe Leu Leu Ala Val Ala Asp Ala Ala Leu Pro 
5 10 15 20 

ACC GGC AGA TTC TGC CGA GTT TGG AAG GTG CCT CCG GGA GGA ACC ATC 5804 
Thr Gly Arg Phe Cys Arg Val Trp Lys Val Pro Pro Gly Gly Thr He 
25 30 35 

CAA GAG AAC CTG GCG GTG CTC GCG GAA TCG CCG GTC ACG GGA CAC GCG 58 52 

Gin Glu Asn Leu Ala Val Leu Ala Glu Ser Pro Val Thr Gly His Ala 
40 45 50 

ACA TAT CCG CCG CCT GAA GGC GCC GTC AGC TTT CAG ATT TTT GCG GAC 5900 
Thr Tyr Pro Pro Pro Glu Gly Ala Val Ser Phe Gin He Phe Ala Asp 
55 60 65 

ACC CCT ACT TTG CGC ATT CGC TAC GGG CCT ACG GAG GAC GAA CTT GCA 5948 
Thr Pro Thr Leu Arg He Arg Tyr Gly Pro Thr Glu Asp Glu Leu Ala 
70 75 80 

CTG GAG CGC GGG ACG TCC GCC TCA GAC GCG GAC AAC GTG ACA TTT TCG 5996 
Leu Glu Arg Gly Thr Ser Ala Ser Asp Ala Asp Asn Val Thr Phe Ser 
85 90 95 100 

CTG TCA TAT CGC CCG CGC CCA GAA ATT CAC GGA GCA TAC TTC ACC ATA 6 044 

Leu Ser Tyr Arg Pro Arg Pro Glu He His Gly Ala Tyr Phe Thr He 
105 HO H5 

GGG GTA TTC GCT ACT GGC CAG AGC ACG GAA AGC AGC TAT TCG GTC ATC 6092 
Gly Val Phe Ala Thr Gly Gin Ser Thr Glu Ser Ser Tyr Ser Val He 
120 125 130 

AGT CGG GTC TTA GTT AAC GCC TCT CTG GAA CGG TCC GTG CGC CTG GAA 6140 
Ser Arg Val Leu Val Asn Ala Ser Leu Glu Arg Ser Val Arg Leu Glu 
135 140 145 

ACG CCG TGC GAT GAA AAT TTT TTG CAG AAC GAG CCT ACA TGG GGC TCG 6188 
Thr Pro Cys Asp Glu Asn Phe Leu Gin Asn Glu Pro Thr Trp Gly Ser 
150 155 160 



AAG CGT TGG TTA GGC CCC CCG TCG CCT TAT GTG CGA GAT AAC GAT GTC 
Lys Arg Trp Leu Gly Pro Pro Ser Pro Tyr Val Arg Asp Asn Asp Val 
165 170 175 180 

GCC GTG TTG ACA AAA GCG CAG TAC ATT GGG GAG TGC TAC TCC AAC TCG 
Ala Val Leu Thr Lys Ala Gin Tyr He Gly Glu Cys Tyr Ser Asn Ser 
185 190 195 

GCG GCC CAG ACG GGG CTC ACG TCT CTC AAC ATG ACC TTT TTC TAT TCG 
Ala Ala Gin Thr Gly Leu Thr Ser Leu Asn Met Thr Phe Phe Tyr Ser 
200 205 210 

CCT AAA AGA ATA GTA AAC GTC ACG TGG ACA ACC GGC GGC CCC TCC CCC 
Pro Lys Arg He Val Asn Val Thr Trp Thr Thr Gly Gly Pro Ser Pro 
215 220 225 

TCG CGC ATA ACG GTA TAC TCG TCG CGG GAG AAC GGG CAG CCC GTG TTG 
Ser Arg He Thr Val Tyr Ser Ser Arg Glu Asn Gly Gin Pro Val Leu 
230 235 240 

AGG AAC GTT TCT GAC GGG TTC TTG GTT AAG TAC ACT CCC GAC ATT GAC 
Arg Asn Val Ser Asp Gly Phe Leu Val Lys Tyr Thr Pro Asp He Asp 
245 250 255 260 

GGC CGG GCC ATG ATA AAC GTT ATT GCC AAT TAT TCG CCG GCG GAC TCC 
Gly Arg Ala Met He Asn Val He Ala Asn Tyr Ser Pro Ala Asp Ser 
265 270 275 

GGC AGC GTC CTC GCG TTT ACG GCC TTT AGG GAA GGA AAA CTC CCA TCC 
Gly Ser Val Leu Ala Phe Thr Ala Phe Arg Glu Gly Lys Leu Pro Ser 
280 285 290 

GCG ATT CAA CTG CAC CGG ATA GAT ATG TCC GGG ACT GAG CCG CCG GGG 
Ala He Gin Leu His Arg He Asp Met Ser Gly Thr Glu Pro Pro Gly 
295 300 305 

ACT GAA ACG ACC TTC GAC TGT CAA AAA ATG ATA GAA ACC CCG TAC CGA 
Thr Glu Thr Thr Phe Asp Cys Gin Lys Met He Glu Thr Pro Tyr Arg 
310 315 320 

GCG CTC GGG AGC AAT GTT CCC AGG GAC GAC TCT ATC CGT CCG GGG GCC 
Ala Leu Gly Ser Asn Val Pro Arg Asp Asp Ser He Arg Pro Gly Ala 
325 330 335 340 

ACT CTG CCT CCG TTC GAT ACC GCA GCA CCT GAT TTC GAT ACA GGT ACT 
Thr Leu Pro Pro Phe Asp Thr Ala Ala Pro Asp Phe Asp Thr Gly Thr 
345 350 355 

TCC CCG ACC CCC ACT ACC GTG CCA GAG CCA GCC ATT ACT ACA CTC ATA 
Ser Pro Thr Pro Thr Thr Val Pro Glu Pro Ala He Thr Thr Leu He 
360 365 370 

CCG CGC AGC ACT AGC GAT ATG GGA TTC TTC TCC ACG GCA CGT GCT ACC 
Pro Arg Ser Thr Ser Asp Met Gly Phe Phe Ser Thr Ala Arg Ala Thr 
375 380 385 

GGA TCA GAA ACT CTT TCG GTA CCC GTC CAG GAA ACG GAT AGA ACT CTT 
Gly Ser Glu Thr Leu Ser Val Pro Val Gin Glu Thr Asp Arg Thr Leu 
390 395 400 

TCG ACA ACT CCT CTT ACC CTT CCA CTG ACT CCC GGT GAG TCA GAA AAT 
Ser Thr Thr Pro Leu Thr Leu Pro Leu Thr Pro Gly Glu Ser Glu Asn 
405 410 415 420 

ACA CTG TTT CCT ACG ACC GCG CCG GGG ATT TCT ACC GAG ACC CCG AGC 
Thr Leu Phe Pro Thr Thr Ala Pro Gly He Ser Thr Glu Thr Pro Ser 
425 430 435 



-96- 

GCG GCA CAT GAA ACT ACA CAG ACC CAG AGT GCA GAA ACG GTG GTC TTT 
Ala Ala His Glu Thr Thr Gin Thr Gin Ser Ala Glu Thr Val Val Phe 
440 445 450 

ACT CAG AGT CCG AGT ACC GAG TCG GAA ACC GCG CGG TCC CAG AGT CAG 
Thr Gin Ser Pro Ser Thr Glu Ser Glu Thr Ala Arg Ser Gin Ser Gin 
455 460 465 

GAA CCG TGG TAT TTT ACT CAG ACT CCG AGT ACT GAA CAG GCG GCT CTT 
Glu Pro Trp Tyr Phe Thr Gin Thr Pro Ser Thr Glu Gin Ala Ala Leu 
470 475 480 

ACT CAG ACG CAG ATC GCA GAA ACG GAG GCG TTG TTT ACT CAG ACT CCG 
Thr Gin Thr Gin lie Ala Glu Thr Glu Ala Leu Phe Thr Gin Thr Pro 
485 490 495 500 

AGT GCT GAA CAG ATG ACT TTT ACT CAG ACT CCG GGT GCA GAA ACC GAG 
Ser Ala Glu Gin Met Thr Phe Thr Gin Thr Pro Gly Ala Glu Thr Glu 
505 510 515 

GCA CCT GCC CAG ACC CCG AGC ACG ATA CCC GAG ATA TTT ACT CAG TCT 
Ala Pro Ala Gin Thr Pro Ser Thr He Pro Glu He Phe Thr Gin Ser 
520 525 530 

CGT AGC ACG CCC CCC GAA ACC GCT CGC GCT CCG AGC GCG GCG CCG GAG 
Arg Ser Thr Pro Pro Glu Thr Ala Arg Ala Pro Ser Ala Ala Pro Glu 
535 540 545 

GTT TTT ACA CAG AGT TCG AGT ACG GTA ACG GAG GTG TTT ACT CAG ACC 
Val Phe Thr Gin Ser Ser Ser Thr Val Thr Glu Val Phe Thr Gin Thr 
550 555 560 

CCG AGC ACG GTA CCG AAA ACT ACT CTG AGT TCG AGT ACT GAA CCG GCG 
Pro Ser Thr Val Pro Lys Thr Thr Leu Ser Ser Ser Thr Glu Pro Ala 
565 570 575 580 

ATT TTT ACT CGG ACT CAG AGC GCG GGA ACT GAG GCC TTT ACT CAG ACT 
He Phe Thr Arg Thr Gin Ser Ala Gly Thr Glu Ala Phe Thr Gin Thr 
585 590 595 

TCG AGT GCC GAG CCG GAC ACT ATG CGA ACT CAG AGT ACT GAA ACA CAC 
Ser Ser Ala Glu Pro Asp Thr Met Arg Thr Gin Ser Thr Glu Thr His 
600 605 610 

TTT TTC ACT CAG GCC CCG AGT ACG GTA CCG AAA GCT ACT CAG ACT CCG 
Phe Phe Thr Gin Ala Pro Ser Thr Val Pro Lys Ala Thr Gin Thr Pro 
615 620 625 

AGT ACA GAG CCG GAG GTG TTG ACT CAG AGT CCG AGT ACC GAA CCT GTG 
Ser Thr Glu Pro Glu Val Leu Thr Gin Ser Pro Ser Thr Glu Pro Val 
630 635 640 

CCT TTC ACC CGG ACT CTG GGC GCA GAG CCG GAA ATT ACT CAG ACC CCG 
Pro Phe Thr Arg Thr Leu Gly Ala Glu Pro Glu He Thr Gin Thr Pro 
645 650 655 660 

AGC GCG GCA CCG GAG GTT TAT ACT CGG AGT TCG AGT ACG ATG CCA GAA 
Ser Ala Ala Pro Glu Val Tyr Thr Arg Ser Ser Ser Thr Met Pro Glu 
665 670 675 

ACT GCA CAG AGC ACA CCC CTG GCC TCG CAA AAC CCT ACC AGT TCG GGA 
Thr Ala Gin Ser Thr Pro Leu Ala Ser Gin Asn Pro Thr Ser Ser Gly 
680 685 690 

ACC GGG ACG CAT AAT ACT GAA CCG AGG ACT TAT CCA GTG CAA ACG ACA 
Thr Gly Thr His Asn Thr Glu Pro Arg Thr Tyr Pro Val Gin Thr Thr 
695 700 705 



CCA CAT ACC CAG AAA CTC TAC ACA GAA AAT AAG ACT TTA TCG TTT CCT 
Pro His Thr Gin Lys Leu Tyr Thr Glu Asn Lys Thr Leu Ser Phe Pro 
710 715 720 

ACT GTT GTT TCA GAA TTC CAT GAG ATG TCG ACG GCA GAG TCG CAG ACG 
Thr Val Val Ser Glu Phe His Glu Met Ser Thr Ala Glu Ser Gin Thr 
725 730 735 740 

CCC CTA TTG GAC GTC AAA ATT GTA GAG GTG AAG TTT TCA AAC GAT GGC 
Pro Leu Leu Asp Val Lys He Val Glu Val Lys Phe Ser Asn Asp Gly 
745 750 755 

GAA GTA ACG GCG ACT TGC GTT TCC ACC GTC AAA TCT CCC TAT AGG GTA 
Glu Val Thr Ala Thr Cys Val Ser Thr Val Lys Ser Pro Tyr Arg Val 
760 765 770 

GAA ACT AAT TGG AAA GTA GAC CTC GTA GAT GTA ATG GAT GAA ATT TCT 
Glu Thr Asn Trp Lys Val Asp Leu Val Asp Val Met Asp Glu He Ser 
775 780 785 

GGG AAC AGT CCC GCC GGG GTT TTT AAC AGT AAT GAG AAA TGG CAG AAA 
Gly Asn Ser Pro Ala Gly Val Phe Asn Ser Asn Glu Lys Trp Gin Lys 
790 795 800 

CAG CTG TAC TAC AGA GTA ACC GAT GGA AGA ACA TCG GTC CAG CTA ATG 
Gin Leu Tyr Tyr Arg Val Thr Asp Gly Arg Thr Ser Val Gin Leu Met 
805 810 815 820 

TGC CTG TCG TGC ACG AGC CAT TCT CCG GAA CCT TAC TGT CTT TTC GAC 
Cys Leu Ser Cys Thr Ser His Ser Pro Glu Pro Tyr Cys Leu Phe Asp 
825 830 835 

ACG TCT CTT ATA GCG AGG GAA AAA GAT ATC GCG CCA GAG TTA TAC TTT 
Thr Ser Leu He Ala Arg Glu Lys Asp He Ala Pro Glu Leu Tyr Phe 
840 845 850 

ACC TCT GAT CCG CAA ACG GCA TAC TGC ACA ATA ACT CTG CCG TCC GGC 
Thr Ser Asp Pro Gin Thr Ala Tyr Cys Thr He Thr Leu Pro Ser Gly 
855 860 865 

GTT GTT CCG AGA TTC GAA TGG AGC CTT AAT AAT GTT TCA CTG CCG GAA 
Val Val Pro Arg Phe Glu Trp Ser Leu Asn Asn Val Ser Leu Pro Glu 
870 875 880 

TAT TTG ACG GCC ACG ACC GTT GTT TCG CAT ACC GCT GGC CAA AGT ACA 
Tyr Leu Thr Ala Thr Thr Val Val Ser His Thr Ala Gly Gin Ser Thr 
885 890 895 900 

GTG TGG AAG AGC AGC GCG AGA GCA GGC GAG GCG TGG ATT TCT GGC CGG 
Val Trp Lys Ser Ser Ala Arg Ala Gly Glu Ala Trp He Ser Gly Arg 
905 910 915 

GGA GGC AAT ATA TAC GAA TGC ACC GTC CTC ATC TCA GAC GGC ACT CGC 
Gly Gly Asn He Tyr Glu Cys Thr Val Leu He Ser Asp Gly Thr Arg 
920 925 930 

GTT ACT ACG CGA AAG GAG AGG TGC TTA ACA AAC ACA TGG ATT GCG GTG 
Val Thr Thr Arg Lys Glu Arg Cys Leu Thr Asn Thr Trp He Ala Val 
935 940 945 

GAA AAC GGT GCT GCT CAG GCG CAG CTG TAT TCA CTC TTT TCT GGA CTT 
Glu Asn Gly Ala Ala Gin Ala Gin Leu Tyr Ser Leu Phe Ser Gly Leu 
950 955 960 

GTG TCA GGA TTA TGC GGG AGC ATA TCT GCT TTG TAC GCA ACG CTA TGG 
Val Ser Gly Leu Cys Gly Ser He Ser Ala Leu Tyr Ala Thr Leu Trp 
965 970 975 980 



871 
931 



ACC GCC ATT TAT TTT TGAGGAATGC TTTTTGGACT ATCGTACTGC TTTCTTCCTT 86 91 
Thr Ala He Tyr Phe 
985 

CGCTAGCCAG AGCACCGCCG CCGTCACGTA CGACTACATT TTAGGCCGTC GCGCGCTCGA 8 751 

CGCGCTAACC ATACCGGCGG TTGGCCCGTA TAACAGATAC CTCACTAGGG TATCAAGAGG 8811 

CTGCGACGTT GTCGAGCTCA ACCCGATTTC TAACGTGGAC GACATGATAT CGGCGGCCAA 

AGAAAAAGAG AAGGGGGGCC CTTTCGAGGC CTCCGTCGTC TGGTTCTACG TGATTAAGGG 

CGACGACGGC GAGGACAAGT ACTGTCCAAT CTATAGAAAA GAGTACAGGG AATGTGGCGA 8 991 

CGTACAACTG CTATCTGAAT GCGCCGTTCA ATCTGCACAG ATGTGGGCAG TGGACTATGT 9051 

TCCTAGCACC CTTGTATCGC GAAATGGCGC GGGACTGACT ATATTCTCCC CCACTGCTGC 9111 

GCTCTCTGGC CAATACTTGC TGACCCTGAA AATCGGGAGA TTTGCGCAAA CAGCTCTCGT 9171 

AACTCTAGAA GTTAACGATC GCTGTTTAAA GATCGGGTCG CAGCTTAACT TTTTACCGTC 9231 

GAAATGCTGG ACAACAGAAC AGTATCAGAC TGGATTTCAA GGCGAACACC TTTATCCGAT 9291 

CGCAGACACC AATACACGAC ACGCGGACGA CGTATATCGG GGATACGAAG ATATTCTGCA 93 51 

GCGCTGGAAT AATTTGCTGA GGAAAAAGAA TCCTAGCGCG CCAGACCCTC GTCCAGATAG 9411 

CGTCCCGCAA GAAATTCCCG CTGTAACCAA GAAAGCGGAA GGGCGCACCC CGGACGCAGA 9471 

AAGCAGCGAA AAGAAGGCCC CTCCAGAAGA CTCGGAGGAC GACATGCAGG CAGAGGCTTC 9531 

TGGAGAAAAT CCTGCCGCCC TCCCCGAAGA CGACGAAGTC CCCGAGGACA CCGAGCACGA 9591 

TGATCCAAAC TCGGATCCTG ACTATTACAA TGACATGCCC GCCGTGATCC CGGTGGAGGA 9651 

GACTACTAAA AGTTCTAATG CCGTCTCCAT GCCCATATTC GCGGCGTTCG TAGCCTGCGC 9711 

GGTCGCGCTC GTGGGGCTAC TGGTTTGGAG CATCGTAAAA TGCGCGCGTA GCTAATCGAG 9771 

CCTAGAATAG GTGGTTTCTT CCTACATGCC ACGCCTCACG CTCATAATAT AAATCACATG 9831 

GAATAGCATA CCAATGCCTA TTCATTGGGA CGTTCGAAAA GC 9873 

ATG GCA TCG CTA CTT GGA ACT 9894 
Met Ala Ser Leu Leu Gly Thr 
1 5 

CTG GCT CTC CTT GCC GCG ACG CTC GCA CCC TTC GGC GCG ATG GGA ATC 9942 
Leu Ala Leu Leu Ala Ala Thr Leu Ala Pro Phe Gly Ala Met Gly He 
10 15 20 

GTG ATC ACT GGA AAT CAC GTC TCC GCC AGG ATT GAC GAC GAT CAC ATC 9990 
Val He Thr Gly Asn His Val Ser Ala Arg He Asp Asp Asp His He 
25 30 35 

GTG ATC GTC GCG CCT CGC CCC GAA GCT ACA ATT CAA CTG CAG CTA TTT 10038 
Val He Val Ala Pro Arg Pro Glu Ala Thr He Gin Leu Gin Leu Phe 
40 45 50 55 

TTC ATG CCT GGC CAG AGA CCC CAC AAA CCC TAC TCA GGA ACC GTC CGC 10086 
Phe Met Pro Gly Gin Arg Pro His Lys Pro Tyr Ser Gly Thr Val Arg 
60 65 70 

GTC GCG TTT CGG TCT GAT ATA ACA AAC CAG TGC TAC CAG GAA CTT AGC 10134 
Val Ala Phe Arg Ser Asp He Thr Asn Gin Cys Tyr Gin Glu Leu Ser 



GAG GAG CGC TTT GAA AAT TGC ACT CAT CGA TCG TCT TCT GTT TTT GTC 
Glu Glu Arg Phe Glu Asn Cys Thr His Arg Ser Ser Ser Val Phe Val 
90 95 100 

GGC TGT AAA GTG ACC GAG TAC ACG TTC TCC GCC TCG AAC AGA CTA ACC 
Gly Cys Lys Val Thr Glu Tyr Thr Phe Ser Ala Ser Asn Arg Leu Thr 
105 HO 115 

GGA CCT CCA CAC CCG TTT AAG CTC ACT ATA CGA AAT CCT CGT CCG AAC 
Gly Pro Pro His Pro Phe Lys Leu Thr lie Arg Asn Pro Arg Pro Asn 
120 125 130 135 

GAC AGC GGG ATG TTC TAC GTA ATT GTT CGG CTA GAC GAC ACC AAA GAA 
Asp Ser Gly Met Phe Tyr Val He Val Arg Leu Asp Asp Thr Lys Glu 
140 145 150 

CCC ATT GAC GTC TTC GCG ATC CAA CTA TCG GTG TAT CAA TTC GCG AAC 
Pro He Asp Val Phe Ala He Gin Leu Ser Val Tyr Gin Phe Ala Asn 
155 160 165 

ACC GCC GCG ACT CGC GGA CTC TAT TCC AAG GCT TCG TGT CGC ACC TTC 
Thr Ala Ala Thr Arg Gly Leu Tyr Ser Lys Ala Ser Cys Arg Thr Phe 
170 175 180 

GGA TTA CCT ACC GTC CAA CTT GAG GCC TAT CTC AGG ACC GAG GAA AGT 
Gly Leu Pro Thr Val Gin Leu Glu Ala Tyr Leu Arg Thr Glu Glu Ser 
185 190 195 

TGG CGC AAC TGG CAA GCG TAC GTT GCC ACG GAG GCC ACG ACG ACC AGC 
Trp Arg Asn Trp Gin Ala Tyr Val Ala Thr Glu Ala Thr Thr Thr Ser 
200 205 210 215 

GCC GAG GCG ACA ACC CCG ACG CCC GTC ACT GCA ACC AGC GCC TCC GAA 
Ala Glu Ala Thr Thr Pro Thr Pro Val Thr Ala Thr Ser Ala Ser Glu 
220 225 230 

CTT GAA GCG GAA CAC TTT ACC TTT CCC TGG CTA GAA AAT GGC GTG GAT 
Leu Glu Ala Glu His Phe Thr Phe Pro Trp Leu Glu Asn Gly Val Asp 
235 240 245 

CAT TAC GAA CCG ACA CCC GCA AAC GAA AAT TCA AAC GTT ACT GTC CGT 
His Tyr Glu Pro Thr Pro Ala Asn Glu Asn Ser Asn Val Thr Val Arg 
250 255 260 

CTC GGG ACA ATG AGC CCT ACG CTA ATT GGG GTA ACC GTG GCT GCC GTC 
Leu Gly Thr Met Ser Pro Thr Leu He Gly Val Thr Val Ala Ala Val 
265 270 275 

GTG AGC GCA ACG ATC GGC CTC GTC ATT GTA ATT TCC ATC GTC ACC AGA 
Val Ser Ala Thr He Gly Leu Val He Val He Ser He Val Thr Arg 
280 285 290 295 

AAC ATG TGC ACC CCG CAC CGA AAA TTA GAC ACG GTC TCG CAA GAC GAC 
Asn Met Cys Thr Pro His Arg Lys Leu Asp Thr Val Ser Gin Asp Asp 
300 305 310 

GAA GAA CGT TCC CAA ACT AGA AGG GAA TCG CGA AAA TTT GGA CCC ATG 
Glu Glu Arg Ser Gin Thr Arg Arg Glu Ser Arg Lys Phe Gly Pro Met 
315 320 325 

GTT GCG TGC GAA ATA AAC AAG GGC GCT GAC CAG GAT AGT GAA CTT GTG 
Val Ala Cys Glu He Asn Lys Gly Ala Asp Gin Asp Ser Glu Leu Val 
330 335 340 

GAA CTG GTT GCG ATT GTT AAC CCG TCT GCG CTA AGC TCG CCC GAC TCA 
Glu Leu Val Ala He Val Asn Pro Ser Ala Leu Ser Ser Pro Asp Ser 
345 350 355 



-100- 



ATA AAA ATG TGATTAAGTC TGAATGTGGC TCTCCAATCA TTTCGATTCT 
lie Lys Met 



CTAATCTCCC AATCCTCTCA AAAGGGGCAG TATCGGACAC GGACTGGGAG GGGCGTACTA 
CACGATAGTT ATATGGTACA GCAGAGGCCT CTGAACACTT AGGAGGAGAA TTCAGCCGGG 



11059 
11119 
11173 



ATA GTT CTC GCC TCT TGT CTT GCG CGC CTA ACT TTT GCG ACG CGA CAC 
He Val Leu Ala Ser Cys Leu Ala Arg Leu Thr Phe Ala Thr Arg His 
10 15 20 

GTC CTC TTT TTG GAA GGC ACT CAG GCT GTC CTC GGG GAA GAT GAT CCC 
Val Leu Phe Leu Glu Gly Thr Gin Ala Val Leu Gly Glu Asp Asp Pro 
25 30 35 

AGA AAC GTT CCG GAA GGG ACT GTA ATC AAA TGG ACA AAA GTC CTG CGG 
Arg Asn Val Pro Glu Gly Thr Val He Lys Trp Thr Lys Val Leu Arg 
40 45 50 

AAC GCG TGC AAG ATG AAG GCG GCC GAT GTC TGC TCT TCG CCT AAC TAT 
Asn Ala Cys Lys Met Lys Ala Ala Asp Val Cys Ser Ser Pro Asn Tyr 
55 60 65 

TGC TTT CAT GAT TTA ATT TAC GAC GGA GGA AAG AAA GAC TGC CCG CCC 
Cys Phe His Asp Leu He Tyr Asp Gly Gly Lys Lys Asp Cys Pro Pro 



GCG GGA CCC CTG TCT GCA AAC CTG GTA ATT TTA CTA AAG CGC GGC GAA 
Ala Gly Pro Leu Ser Ala Asn Leu Val He Leu Leu Lys Arg Gly Glu 
90 95 100 

AGC TTC GTC GTG CTG GGT TCT GGG CTA CAC AAC AGC AAT ATA ACT AAT 
Ser Phe Val Val Leu Gly Ser Gly Leu His Asn Ser Asn He Thr Asn 
105 110 115 

ATC ATG TGG ACA GAG TAC GGA GGC CTG CTC TTT GAT CCT GTA ACT CGT 
He Met Trp Thr Glu Tyr Gly Gly Leu Leu Phe Asp Pro Val Thr Arg 
120 125 130 

TCG GAC GAG GGA ATC TAT TTT CGA CGG ATC TCT CAG CCA GAT CTG GCC 
Ser Asp Glu Gly He Tyr Phe Arg Arg He Ser Gin Pro Asp Leu Ala 
135 140 145 

ATG GAA ACT ACA TCG TAC AAC GTC AGC GTT CTT TCG CAC GTA GAC GAG 
Met Glu Thr Thr Ser Tyr Asn Val Ser Val Leu Ser His Val Asp Glu 
150 155 160 165 

AAG GCT CCA GCA CCG CAC GAG GTG GAG ATA GAC ACC ATC AAG CCG TCA 
Lys Ala Pro Ala Pro His Glu Val Glu He Asp Thr He Lys Pro Ser 
170 175 180 

GAG GCC CAC GCG CAC GTG GAA TTA CAA ATG CTG CCG TTT CAT GAA CTC 
Glu Ala His Ala His Val Glu Leu Gin Met Leu Pro Phe His Glu Leu 
185 190 195 

AAC GAC AAC AGC CCC ACC TAT GTG ACC CCT GTT CTT AGA GTC TTC CCA 
Asn Asp Asn Ser Pro Thr Tyr Val Thr Pro Val Leu Arg Val Phe Pro 
200 205 210 

CCG ACC GAG CAC GTA AAA TTT AAC GTT ACG TAT TCG TGG TAT GGG TTT 
Pro Thr Glu His Val Lys Phe Asn Val Thr Tyr Ser Trp Tyr Gly Phe 
215 220 225 



*0 



GAT GTC AAA GAG GAG TGC GAA GAA GTG AAA CTG TTC GAG CCG TGC GTA 
Asp Val Lys Glu Glu Cys Glu Glu Val Lys Leu Phe Glu Pro Cys Val 
230 235 240 245 

TAC CAT CCT ACA GAC GGC AAA TGT CAG TTT CCC GCA ACC AAC CAG AGA 
Tyr His Pro Thr Asp Gly Lys Cys Gin Phe Pro Ala Thr Asn Gin Arg 
250 255 260 

TGC CTC ATA GGA TCT GTC TTG ATG GCG GAA TTC TTG GGC GCG GCC TCT 
Cys Leu He Gly Ser Val Leu Met Ala Glu Phe Leu Gly Ala Ala Ser 
265 270 275 

TTG CTG GAT TGT TCC CGC GAT ACT CTA GAA GAC TGC CAC GAA AAT CGC 
Leu Leu Asp Cys Ser Arg Asp Thr Leu Glu Asp Cys His Glu Asn Arg 
280 285 290 

GTG CCG AAC CTA CGG TTC GAT TCG CGA CTC TCC GAG TCA CGC GCA GGC 
Val Pro Asn Leu Arg Phe Asp Ser Arg Leu Ser Glu Ser Arg Ala Gly 
295 300 305 

CTG GTG ATC AGT CCT CTT ATA GCC ATC CCC AAA GTT TTG ATT ATA GTC 
Leu Val He Ser Pro Leu He Ala He Pro Lys Val Leu He He Val 
310 315 320 325 

GTT TCC GAC GGA GAC ATT TTG GGA TGG AGC TAC ACG GTG CTC GGG AAA 
Val Ser Asp Gly Asp He Leu Gly Trp Ser Tyr Thr Val Leu Gly Lys 
330 335 340 

CGT AAC AGT CCG CGC GTA GTA GTC GAA ACG CAC ATG CCC TCG AAG GTC 
Arg Asn Ser Pro Arg Val Val Val Glu Thr His Met Pro Ser Lys Val 
345 350 355 

CCG ATG AAC AAA GTA GTA ATT GGC AGT CCC GGA CCA ATG GAC GAA ACG 
Pro Met Asn Lys Val Val He Gly Ser Pro Gly Pro Met Asp Glu Thr 
360 365 370 

GGT AAC TAT AAA ATG TAC TTC GTC GTC GCG GGG GTG GCC GCG ACG TGC 
Gly Asn Tyr Lys Met Tyr Phe Val Val Ala Gly Val Ala Ala Thr Cys 
375 380 385 

GTA ATT CTT ACA TGC GCT CTG CTT GTG GGG AAA AAG AAG TGC CCC GCG 
Val He Leu Thr Cys Ala Leu Leu Val Gly Lys Lys Lys Cys Pro Ala 
390 395 400 405 

CAC CAA ATG GGT ACT TTT TCC AAG ACC GAA CCA TTG TAC GCG CCG CTC 
His Gin Met Gly Thr Phe Ser Lys Thr Glu Pro Leu Tyr Ala Pro Leu 
410 415 420 

CCC AAA AAC GAG TTT GAG GCC GGC GGG CTT ACG GAC GAT GAG GAA GTG 
Pro Lys Asn Glu Phe Glu Ala Gly Gly Leu Thr Asp Asp Glu Glu Val 
425 430 435 

ATT TAT GAC GAA GTA TAC GAA CCC CTA TTT CGC GGC TAC TGT AAG CAG 
He Tyr Asp Glu Val Tyr Glu Pro Leu Phe Arg Gly Tyr Cys Lys Gin 
440 445 450 

GAA TTC CGC GAA GAT GTG AAT ACC TTT TTC GGT GCG GTC GTG GAG GGA 
Glu Phe Arg Glu Asp Val Asn Thr Phe Phe Gly Ala Val Val Glu Gly 
455 460 465 

GAA AGG GCC TTA AAC TTT AAA TCC GCC ATC GCA TCA ATG GCA GAT CGC 
Glu Arg Ala Leu Asn Phe Lys Ser Ala He Ala Ser Met Ala Asp Arg 
470 475 480 485 



ATC CTG GCA AAT AAA AGC GGC AGA AGG AAT ATG GAT AGC TAT TAGTTGGTC 12664 
He Leu Ala Asn Lys Ser Gly Arg Arg Asn Met Asp Ser Tyr 

490 495 500 



ATG CCT TTT AAG ACC AGA GGG GCC GAA GAC 
Met Pro Phe Lys Thr Arg Gly Ala Glu Asp 
! 5 10 

GCG GCC GCG GGC AAG AAC AGG TTT AAG AAA TCG AGA AAT CGG GAA ATC 
Ala Ala Ala Gly Lys Asn Arg Phe Lys Lys Ser Arg Asn Arg Glu He 
15 20 25 

TTA CCG ACC AGA CTG CGT GGC ACC GGT AAG AAA ACT GCC GGA TTG TCC 
Leu Pro Thr Arg Leu Arg Gly Thr Gly Lys Lys Thr Ala Gly Leu Ser 
30 35 40 

AAT TAT ACC CAG CCT ATT CCC TGG AAC CCT AAA TTC TGC AGC GCG CGC 
Asn Tyr Thr Gin Pro He Pro Trp Asn Pro Lys Phe Cys Ser Ala Arg 
45 50 55 

GGG GAA TCT GAC AAC CAC GCG TGT AAA GAC ACT TTT TAT CGC AGG ACG 
Gly Glu Ser Asp Asn His Ala Cys Lys Asp Thr Phe Tyr Arg Arg Thr 
60 65 70 

TGC TGC GCA TCG CGC TCT ACC GTT TCC AGT CAA CCC GAT TCC CCC CAC 
Cys Cys Ala Ser Arg Ser Thr Val Ser Ser Gin Pro Asp Ser Pro His 
75 80 85 90 

ACA CCC ATG CCT ACT GAG TAT GGG CGC GTG CCC TCC GCA AAG CGC AAA 
Thr Pro Met Pro Thr Glu Tyr Gly Arg Val Pro Ser Ala Lys Arg Lys 
95 100 105 

AAA CTA TCA TCT TCA GAC TSS GAG GGC GCG CAC CAA CCC CTA GTA TCC 
Lys Leu Ser Ser Ser Asp Xaa Glu Gly Ala His Gin Pro Leu Val Ser 
110 115 120 

TGT AAA CTT CCG GAT TCT CAA GCA GCA CCG GCG CGA ACC TAT AGT TCT 
Cys Lys Leu Pro Asp Ser Gin Ala Ala Pro Ala Arg Thr Tyr Ser Ser 
125 130 135 

GCG CAA AGA TAT ACT GTT GAC GAG GTT TCG TCG CCA ACT CCG CCA GGC 
Ala Gin Arg Tyr Thr Val Asp Glu Val Ser Ser Pro Thr Pro Pro Gly 
140 145 150 

GTC GAC GCT GTT GCG GAC TTA GAA ACG CGC GCG GAA CTT CCT GGC GCT 
Val Asp Ala Val Ala Asp Leu Glu Thr Arg Ala Glu Leu Pro Gly Ala 
155 160 165 170 

ACG ACG GAA CAA ACG GAA AGT AAA AAT AAG CTC CCC AAC CAA CAA TCG 
Thr Thr Glu Gin Thr Glu Ser Lys Asn Lys Leu Pro Asn Gin Gin Ser 
175 180 185 

CGC CTG AAG CCG AAA CCC ACA AAC GAG CAC GTC GGA GGG GAG CGG TGC 
Arg Leu Lys Pro Lys Pro Thr Asn Glu His Val Gly Gly Glu Arg Cys 
190 195 200 

CCC TCC GAA GGC ACG GTC GAG GCG CCA TCG CTC GGC ATC CTC TCG CGC 
Pro Ser Glu Gly Thr Val Glu Ala Pro Ser Leu Gly He Leu Ser Arg 
205 210 215 

GTC GGG GCA GCG ATA GCA AAC GAG CTG GCT CGT ATG CGG AGG GCG TGT 
Val Gly Ala Ala He Ala Asn Glu Leu Ala Arg Met Arg Arg Ala Cys 
220 225 230 

CTT CCG CTC GCC GCG TCG GCG GCC GCT GCC GGA ATA GTG GCC TGG GCC 
Leu Pro Leu Ala Ala Ser Ala Ala Ala Ala Gly He Val Ala Trp Ala 
235 240 245 250 

GCG GCG AGG GCC TTG CAG AAA CAA GGG CGG TAG CAG T AAT AAT A AC CAC AC AA 
Ala Ala Arg Ala Leu Gin Lys Gin Gly Arg * 
255 260 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Phe Arg Arg lie Cys Ser Arg Ser Arg Ala Glu Lys Arg Arg 
1 5 10 15 

Arg Thr Thr Glu Asn Pro Leu Thr Ser Lys Arg Val Cys Val Leu Asp 
20 25 30 

Ser Phe Ser Arg Thr Met Ser Leu Arg Pro Tyr Ala Glu lie Leu Pro 
35 40 45 

Thr Ala Glu Gly Val Glu Arg Leu Ala Glu Leu Val Ser Val Thr Met 
50 55 60 

Thr Glu Arg Ala Glu Pro Val Thr Glu Asn Thr Ala Val Asn Ser He 
65 70 75 80 

Pro Pro Ala Asn Glu Asn Gly Gin Asn Phe Ala Tyr Ala Gly Asp Gly 
85 90 * 95 

Pro Ser Thr Thr Glu Lys Val Asp Gly Ser His Thr Asp Phe Asp Glu 
100 105 110 

Ala Ser Ser Asp Tyr Ala Gly Pro Val Pro Leu Ala Gin Thr Arg Leu 
115 120 125 

Lys His Ser Asp Glu Phe Leu Gin His Phe Arg Val Leu Asp Asp Leu 
130 135 140 

Val Glu Gly Ala Tyr Gly Phe He Cys Gly Val Arg Arg Tyr Thr Glu 
145 150 155 160 

Glu Glu Gin Arg Arg Arg Gly Val Asn Ser Thr Asn Gin Gly Lys Ser 
165 170 175 

Lys Cys Lys Arg Leu He Ala Lys Tyr Val Lys Asn Gly Thr Arg Ala 
180 185 190 

Ala Ser Gin Leu Glu Asn Glu He Leu Val Leu Gly Arg Leu Asn His 
195 200 205 

Glu Asn Val Leu Lys He Gin Glu He Leu Arg Tyr Pro Asp Asn Thr 
210 215 220 

Tyr Met Leu Thr Gin Arg Tyr Gin Phe Asp Leu Tyr Ser Tyr Met Tyr 
225 230 235 240 

Asp Glu Ala Phe Asp Trp Lys Asp Ser Pro Met Leu Lys Gin Thr Arg 
245 250 " 255 

Arg He Met Lys Gin Leu Met Ser Ala Val Ser Tyr He His Ser Lys 
260 265 270 



# 



Lys Leu He His Arg Asp He Lys Leu Glu Asn He Phe Leu Asn Cys 
275 280 285 

Asp Gly Lys Thr Val Leu Gly Asp Phe Gly Thr Val Thr Pro Phe Glu 
290 295 300 

Asn Glu Arg Glu Pro Phe Glu Tyr Gly Trp Val Gly Thr Val Ala Thr 
305 310 315 320 

Asn Ser Pro Glu He Leu Ala Arg Asp Ser Tyr Cys Glu He Thr Asp 
325 330 335 

He Trp Ser Cys Gly Val Val Leu Leu Glu Met Val Ser His Glu Phe 
340 345 350 

Cys Pro He Gly Asp Gly Gly Gly Asn Pro His Gin Gin Leu Leu Lys 
355 360 365 

Val He Asp Ser Leu Ser Val Cys Asp Glu Glu Phe Pro Asp Pro Pro 
370 375 380 

Cys Asn Leu Tyr Asn Tyr Leu His Tyr Ala Ser He Asp Arg Ala Gly 
385 390 395 400 

His Thr Val Pro Ser Leu He Arg Asn Leu His Leu Pro Ala Asp Val 
405 410 415 

Glu Tyr Pro Leu Val Lys Met Leu Thr Phe Asp Trp Arg Leu Arg Pro 
420 425 430 

Ser Ala Ala Glu Val Leu Ala Met Pro Leu Phe Ser Ala Glu Glu Glu 
435 440 445 

Arg Thr He Thr He He His Gly Lys His Lys Pro He Arg Pro Glu 
450 455 460 

He Arg Ala Arg Val Pro Arg Ser Met Ser Glu Gly 
465 470 475 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Thr Leu Pro His Arg Leu Thr Lys Arg Pro Phe Ala Arg Arg Phe 
15 10 15 

Cys Ser Val Phe Val He His Tyr Ser Glu Thr Lys Leu Asp Arg Tyr 
20 25 30 

Asn Lys Thr Met Leu Leu Tyr Arg Pro Asp Ser Thr Met Arg His Ser 
35 40 45 

Gly Gly Asp Ala Asn His Arg Gly He Arg Pro Arg Arg Lys Ser He 
50 55 60 

Gly Ala Phe Ser Ala Arg Glu Lys Thr Gly Lys Arg Asn Ala Leu Thr 
65 70 75 80 
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Glu Ser Ser Ser Ser Ser Asp Met Leu Asp Pro Phe Ser Thr Asp Lys 
85 90 95 

Glu Phe Gly Gly Lys Trp Thr Val Asp Gly Pro Ala Asp lie Thr Ala 
100 105 110 

Glu Val Leu Ser Gin Ala Trp Asp Val Leu Gin Leu Val Lys His Glu 
115 120 125 

Asp Ala Glu Glu Glu Arg Val Thr Tyr Glu Ser Lys Pro Thr Pro lie 
130 135 140 

Gin Pro Phe Asn Ala Trp Pro Asp Gly Pro Ser Trp Asn Ala Gin Asp 
145 150 155 160 

Phe Thr Arg Ala Pro He Val Tyr Pro Ser Ala Glu Val Leu Asp Ala 
165 170 175 

Glu Ala Leu Lys Val Gly Ala Phe Val Ser Arg Val Leu Gin Cys Val 
180 185 190 

Pro Phe Thr Arg Ser Lys Lys Ser Val Thr Val Arg Asp Ala Gin Ser 
195 200 205 

Phe Leu Gly Asp Ser Phe Trp Arg He Met Gin Asn Val Tyr Thr Val 
210 215 220 

Cys Leu Arg Gin His He Thr Arg Leu Arg His Pro Ser Ser Lys Ser 
225 230 235 240 

He Val Asn Cys Asn Asp Pro Leu Trp Tyr Ala Tyr Ala Asn Gin Phe 
245 250 255 

His Trp Arg Gly Met Arg Val Pro Ser Leu Lys Leu Ala Ser Pro Pro 
260 265 270 

Glu Glu Asn He Gin His Gly Pro Met Ala Ala Val Phe Arg Asn Ala 
275 280 285 

Gly Ala Gly Leu Phe Leu Trp Pro Ala Met Arg Ala Ala Phe Glu Glu 
290 295 300 

Arg Asp Lys Arg Leu Leu Arg Ala Cys Leu Ser Ser Leu Asp He Met 
305 310 315 320 

Asp Ala Ala Val Leu Ala Ser Phe Pro Phe Tyr Trp Arg Gly Val Gin 
325 330 335 

Asp Thr Ser Arg Phe Glu Pro Ala Leu Gly Cys Leu Ser Glu Tyr Phe 
340 345 350 

Ala Leu Val Val Leu Leu Ala Glu Thr Val Leu Ala Thr Met Phe Asp 
355 360 365 

His Ala Leu Val Phe Met Arg Ala Leu Ala Asp Gly Asn Phe Asp Asp 
370 375 380 

Tyr Asp Glu Thr Arg Tyr He Asp Pro Val Lys Asn Glu Tyr Leu Asn 
385 390 395 400 

Gly Ala Glu Gly Thr Leu Leu Arg Gly He Val Ala Ser Asn Thr Ala 
405 410 415 

Leu Ala Val Val Cys Ala Asn Thr Tyr Ser Thr He Arg Lys Leu Pro 
420 425 430 



Ser Val Ala Thr Ser Ala Cys Asn Val Ala Tyr Arg Thr Glu Thr Leu 
435 440 445 

Lys Ala Arg Arg Pro Gly Met Ser Asp lie Tyr Arg lie Leu Gin Lys 
450 455 460 

Glu Phe Phe Phe Tyr He Ala Trp Leu Gin Arg Val Ala Thr His Ala 
465 470 475 480 

Asn Phe Cys Leu Asn He Leu Lys Arg Ser Val Asp Thr Gly Pro Arg 
485 490 495 

His Phe Cys Ser Gly Pro Ala Arg Arg Ser Gly Cys Ser Ser 
500 505 510 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Leu Cys Pro Leu Leu Val Pro He Gin Tyr Glu Asp Phe Ser Lys 
15 10 15 

Ala Met Gly Ser Glu Leu Lys Arg Glu Lys Leu Glu Thr Phe Val Lys 
20 25 30 

Ala He Ser Ser Asp Arg Asp Pro Arg Gly Ser Leu Arg Phe Leu He 
35 40 45 

Ser Asp His Ala Arg Glu He He Ala Asp Gly Val Arg Phe Lys Pro 
50 55 60 

Val He Asp Glu Pro Val Arg Ala Ser Val Ala Leu Ser Thr Ala Ala 
65 70 75 80 

Ala Gly Lys Val Lys Ala Arg Arg Leu Thr Ser Val Arg Ala Pro Val 
85 90 95 

Pro Pro Ala Gly Ala Val Ser Ala Arg Arg Lys Ser Glu He 
100 105 110 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ser Gly Phe Ser Asn He Gly Ser He Ala Thr Val Ser Leu Val 
15 10 15 

Cys Ser Leu Leu Cys Ala Ser Val Leu Gly Ala Pro Val Leu Asp Gly 
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Leu Glu Ser Ser Pro Phe Pro Phe Gly Gly Lys He He Ala Gin Ala 
35 40 45 

Cys Asn Arg Thr Thr He Glu Val Thr Val Pro Trp Ser Asp Tyr Ser 
50 55 60 

Glv Arg Thr Glu Gly Val Ser Val Glu Val Lys Trp Phe Tyr Gly Asn 
65 70 75 80 

Ser Asn Pro Glu Ser Phe Val Phe Gly Val Asp Ser Glu Thr Gly Ser 
85 90 95 

Gly His Glu Asp Leu Ser Thr Cys Trp Ala Leu He His Asn Leu Asn 
100 105 110 

Ala Ser Val Cys Arg Ala Ser Asp Ala Gly He Pro Asp Phe Asp Lys 
115 120 125 

Gin Cys Glu Lys Val Gin Arg Arg Leu Arg Ser Gly Val Glu Leu Gly 
130 135 140 

Ser Tyr Val Ser Gly Asn Gly Ser Leu Val Leu Tyr Pro Gly Met Tyr 
145 150 155 160 

Asp Ala Gly He Tyr Ala Tyr Gin Leu Ser Val Gly Gly Lys Gly Tyr 
165 170 175 

Thr Gly Ser Val Tyr Leu Asp Val Gly Pro Asn Pro Gly Cys His Asp 
180 185 190 

Gin Tyr Gly Tyr Thr Tyr Tyr Ser Leu Ala Asp Glu Ala Ser Asp Leu 
195 200 205 

Ser Ser Tyr Asp Val Ala Ser Pro Glu Leu Asp Gly Pro Met Glu Glu 
210 215 220 

Asp Tyr Ser Asn Cys Leu Asp Met Pro Pro Leu Arg Pro Trp Thr Thr 
225 230 235 240 

Val Cys Ser His Asp Val Glu Glu Gin Glu Asn Ala Thr Asp Glu Leu 
245 250 255 

Tyr Leu Trp Asp Glu Glu Cys Ala Gly Pro Leu Asp Glu Tyr Val Asp 
260 265 270 

Glu Arg Ser Glu Thr Met Pro Arg Met Val Val Phe Ser Pro Pro Ser 
275 280 285 

Thr Leu Gin Gin 
290 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Gly Thr Met Leu Val Leu Arg Leu Phe Leu Leu Ala Val Ala Asp 
15 10 15 
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Ala Ala Leu Pro Thr Gly Arg Phe Cys Arg Val Trp Lys Val Pro Pro 
20 25 30 

Gly Gly Thr lie Gin Glu Asn Leu Ala Val Leu Ala Glu Ser Pro Val 
35 40 45 

Thr Gly His Ala Thr Tyr Pro Pro Pro Glu Gly Ala Val Ser Phe Gin 
50 55 60 

He Phe Ala Asp Thr Pro Thr Leu Arg He Arg Tyr Gly Pro Thr Glu 
65 70 75 80 

Asp Glu Leu Ala Leu Glu Arg Gly Thr Ser Ala Ser Asp Ala Asp Asn 
85 90 95 

Val Thr Phe Ser Leu Ser Tyr Arg Pro Arg Pro Glu lie His Gly Ala 
100 105 HO 

Tyr Phe Thr He Gly Val Phe Ala Thr Gly Gin Ser Thr Glu Ser Ser 
115 120 125 

Tyr Ser Val He Ser Arg Val Leu Val Asn Ala Ser Leu Glu Arg Ser 
130 135 140 

Val Arg Leu Glu Thr Pro Cys Asp Glu Asn Phe Leu Gin Asn Glu Pro 
145 150 155 160 

Thr Trp Gly Ser Lys Arg Trp Leu Gly Pro Pro Ser Pro Tyr Val Arg 
165 170 175 

Asp Asn Asp Val Ala Val Leu Thr Lys Ala Gin Tyr He Gly Glu Cys 
180 185 190 

Tyr Ser Asn Ser Ala Ala Gin Thr Gly Leu Thr Ser Leu Asn Met Thr 
195 200 205 

Phe Phe Tyr Ser Pro Lys Arg He Val Asn Val Thr Trp Thr Thr Gly 
210 215 220 

Gly Pro Ser Pro Ser Arg He Thr Val Tyr Ser Ser Arg Glu Asn Gly 
225 230 235 240 

Gin Pro Val Leu Arg Asn Val Ser Asp Gly Phe Leu Val Lys Tyr Thr 
245 250 255 

Pro Asp He Asp Gly Arg Ala Met He Asn Val He Ala Asn Tyr Ser 
260 265 270 

Pro Ala Asp Ser Gly Ser Val Leu Ala Phe Thr Ala Phe Arg Glu Gly 
275 280 285 

Lys Leu Pro Ser Ala He Gin Leu His Arg He Asp Met Ser Gly Thr 
290 295 300 

Glu Pro Pro Gly Thr Glu Thr Thr Phe Asp Cys Gin Lys Met He Glu 
305 310 315 320 

Thr Pro Tyr Arg Ala Leu Gly Ser Asn Val Pro Arg Asp Asp Ser lie 
325 330 335 

Arg Pro Gly Ala Thr Leu Pro Pro Phe Asp Thr Ala Ala Pro Asp Phe 
340 345 350 

Asp Thr Gly Thr Ser Pro Thr Pro Thr Thr Val Pro Glu Pro Ala He 
355 360 365 



Thr Thr Leu He Pro Arg Ser Thr Ser Asp Met Gly Phe Phe Ser Thr 
370 375 380 

Ala Arq Ala Thr Gly Ser Glu Thr Leu Ser Val Pro Val Gin Glu Thr 
385 390 395 400 

Asp Arg Thr Leu Ser Thr Thr Pro Leu Thr Leu Pro Leu Thr Pro Gly 
405 410 415 

Glu Ser Glu Asn Thr Leu Phe Pro Thr Thr Ala Pro Gly He Ser Thr 
420 425 430 

Glu Thr Pro Ser Ala Ala His Glu Thr Thr Gin Thr Gin Ser Ala Glu 
435 440 445 

Thr Val Val Phe Thr Gin Ser Pro Ser Thr Glu Ser Glu Thr Ala Arg 
450 455 460 

Ser Gin Ser Gin Glu Pro Trp Tyr Phe Thr Gin Thr Pro Ser Thr Glu 
465 470 475 480 

Gin Ala Ala Leu Thr Gin Thr Gin He Ala Glu Thr Glu Ala Leu Phe 
485 490 495 

Thr Gin Thr Pro Ser Ala Glu Gin Met Thr Phe Thr Gin Thr Pro Gly 
500 505 510 

Ala Glu Thr Glu Ala Pro Ala Gin Thr Pro Ser Thr He Pro Glu He 
515 520 525 

Phe Thr Gin Ser Arg Ser Thr Pro Pro Glu Thr Ala Arg Ala Pro Ser 
530 535 540 

Ala Ala Pro Glu Val Phe Thr Gin Ser Ser Ser Thr Val Thr Glu Val 
545 550 555 560 

Phe Thr Gin Thr Pro Ser Thr Val Pro Lys Thr Thr Leu Ser Ser Ser 
565 570 575 

Thr Glu Pro Ala He Phe Thr Arg Thr Gin Ser Ala Gly Thr Glu Ala 
580 585 590 

Phe Thr Gin Thr Ser Ser Ala Glu Pro Asp Thr Met Arg Thr Gin Ser 
595 600 605 

Thr Glu Thr His Phe Phe Thr Gin Ala Pro Ser Thr Val Pro Lys Ala 
610 615 620 

Thr Gin Thr Pro Ser Thr Glu Pro Glu Val Leu Thr Gin Ser Pro Ser 
625 630 635 640 

Thr Glu Pro Val Pro Phe Thr Arg Thr Leu Gly Ala Glu Pro Glu He 
645 650 655 

Thr Gin Thr Pro Ser Ala Ala Pro Glu Val Tyr Thr Arg Ser Ser Ser 
660 665 670 

Thr Met Pro Glu Thr Ala Gin Ser Thr Pro Leu Ala Ser Gin Asn Pro 
675 680 685 

Thr Ser Ser Gly Thr Gly Thr His Asn Thr Glu Pro Arg Thr Tyr Pro 
690 695 700 

Val Gin Thr Thr Pro His Thr Gin Lys Leu Tyr Thr Glu Asn Lys Thr 
705 710 715 720 



Leu Ser Phe Pro Thr Val Val Ser Glu Phe His Glu Met Ser Thr Ala 
725 730 735 

Glu Ser Gin Thr Pro Leu Leu Asp Val Lys lie Val Glu Val Lys Phe 
740 745 750 

Ser Asn Asp Gly Glu Val Thr Ala Thr Cys Val Ser Thr Val Lvs Ser 
755 760 7S5 

Pro Tyr Arg Val Glu Thr Asn Trp Lys Val Asp Leu Val Asp Val Met 
770 775 780 

Asp Glu lie Ser Gly Asn Ser Pro Ala Gly Val Phe Asn Ser Asn Glu 
785 790 795 800 

Lys Trp Gin Lys Gin Leu Tyr Tyr Arg Val Thr Asp Gly Arg Thr Ser 
805 810 815 

Val Gin Leu Met Cys Leu Ser Cys Thr Ser His Ser Pro Glu Pro Tyr 
820 825 830 

Cys Leu Phe Asp Thr Ser Leu lie Ala Arg Glu Lys Asp lie Ala Pro 
835 840 845 

Glu Leu Tyr Phe Thr Ser Asp Pro Gin Thr Ala Tyr Cys Thr lie Thr 
850 855 860 

Leu Pro Ser Gly Val Val Pro Arg Phe Glu Trp Ser Leu Asn Asn Val 
865 870 875 880 

Ser Leu Pro Glu Tyr Leu Thr Ala Thr Thr Val Val Ser His Thr Ala 
885 890 895 

Gly Gin Ser Thr Val Trp Lys Ser Ser Ala Arg Ala Gly Glu Ala Trp 
900 905 910 

lie Ser Gly Arg Gly Gly Asn lie Tyr Glu Cys Thr Val Leu lie Ser 
915 920 925 

Asp Gly Thr Arg Val Thr Thr Arg Lys Glu Arg Cys Leu Thr Asn Thr 
930 935 940 

Trp lie Ala Val Glu Asn Gly Ala Ala Gin Ala Gin Leu Tyr Ser Leu 
945 950 955 960 

Phe Ser Gly Leu Val Ser Gly Leu Cys Gly Ser lie Ser Ala Leu Tyr 
965 970 975 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met Ala Ser Leu Leu Gly Thr Leu Ala Leu Leu Ala Ala Thr Leu Ala 
15 10 15 
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Pro Phe Gly Ala Met Gly lie Val lie Thr Gly Asn His Val Ser Ala 
20 25 30 

Arg He Asp Asp Asp His He Val He Val Ala Pro Arg Pro Glu Ala 
35 40 45 

Thr He Gin Leu Gin Leu Phe Phe Met Pro Gly Gin Arg Pro His Lys 
50 55 60 

Pro Tyr Ser Gly Thr Val Arg Val Ala Phe Arg Ser Asp He Thr Asn 
65 70 75 80 

Gin Cys Tyr Gin Glu Leu Ser Glu Glu Arg Phe Glu Asn Cys Thr His 
85 90 95 

Arg Ser Ser Ser Val Phe Val Gly Cys Lys Val Thr Glu Tyr Thr Phe 
100 105 110 

Ser Ala Ser Asn Arg Leu Thr Gly Pro Pro His Pro Phe Lys Leu Thr 
115 120 125 

He Arg Asn Pro Arg Pro Asn Asp Ser Gly Met Phe Tyr Val He Val 
Q 130 135 140 

yg 

JS Arg Leu Asp Asp Thr Lys Glu Pro He Asp Val Phe Ala He Gin Leu 

^ 145 150 155 160 

yj Ser Val Tyr Gin Phe Ala Asn Thr Ala Ala Thr Arg Gly Leu Tyr Ser 

S] 165 170 175 

,"1 Lys Ala Ser Cys Arg Thr Phe Gly Leu Pro Thr Val Gin Leu Glu Ala 

N 180 185 190 

Tyr Leu Arg Thr Glu Glu Ser Trp Arg Asn Trp Gin Ala Tyr Val Ala 

L, 195 200 205 

M Thr Glu Ala Thr Thr Thr Ser Ala Glu Ala Thr Thr Pro Thr Pro Val 

01 210 215 220 

2 Thr Ala Thr Ser Ala Ser Glu Leu Glu Ala Glu His Phe Thr Phe Pro 
r T 225 230 235 240 

Trp Leu Glu Asn Gly Val Asp His Tyr Glu Pro Thr Pro Ala Asn Glu 
245 250 255 

Asn Ser Asn Val Thr Val Arg Leu Gly Thr Met Ser Pro Thr Leu He 
260 265 270 

Gly Val Thr Val Ala Ala Val Val Ser Ala Thr He Gly Leu Val He 
275 280 285 

Val He Ser He Val Thr Arg Asn Met Cys Thr Pro His Arg Lys Leu 
290 295 300 

Asp Thr Val Ser Gin Asp Asp Glu Glu Arg Ser Gin Thr Arg Arg Glu 
305 310 315 320 

Ser Arg Lys Phe Gly Pro Met Val Ala Cys Glu He Asn Lys Gly Ala 
325 330 335 

Asp Gin Asp Ser Glu Leu Val Glu Leu Val Ala He Val Asn Pro Ser 
340 345 350 

Ala Leu Ser Ser Pro Asp Ser He Lys Met 
355 360 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Asn Met Leu Val lie Val Leu Ala Ser Cys Leu Ala Arg Leu Thr 
15 10 15 

Phe Ala Thr Arg His Val Leu Phe Leu Glu Gly Thr Gin Ala Val Leu 
20 25 30 

Gly Glu Asp Asp Pro Arg Asn Val Pro Glu Gly Thr Val He Lys Trp 
35 40 45 

Thr Lys Val Leu Arg Asn Ala Cys Lys Met Lys Ala Ala Asp Val Cys 
50 55 60 

Ser Ser Pro Asn Tyr Cys Phe His Asp Leu He Tyr Asp Gly Gly Lys 
65 70 75 80 

Lys Asp Cys Pro Pro Ala Gly Pro Leu Ser Ala Asn Leu Val He Leu 
85 90 95 

Leu Lys Arg Gly Glu Ser Phe Val Val Leu Gly Ser Gly Leu His Asn 
100 105 110 

Ser Asn He Thr Asn He Met Trp Thr Glu Tyr Gly Gly Leu Leu Phe 
115 120 125 

Asp Pro Val Thr Arg Ser Asp Glu Gly He Tyr Phe Arg Arg He Ser 
130 135 140 

Gin Pro Asp Leu Ala Met Glu Thr Thr Ser Tyr Asn Val Ser Val Leu 
145 150 • 155 160 

Ser His Val Asp Glu Lys Ala Pro Ala Pro His Glu Val Glu He Asp 
165 170 175 

Thr He Lys Pro Ser Glu Ala His Ala His Val Glu Leu Gin Met Leu 
180 185 190 

Pro Phe His Glu Leu Asn Asp Asn Ser Pro Thr Tyr Val Thr Pro Val 
195 200 205 

Leu Arg Val Phe Pro Pro Thr Glu His Val Lys Phe Asn Val Thr Tyr 
210 215 220 

Ser Trp Tyr Gly Phe Asp Val Lys Glu Glu Cys Glu Glu Val Lys Leu 
225 230 235 240 

Phe Glu Pro Cys Val Tyr His Pro Thr Asp Gly Lys Cys Gin Phe Pro 
245 250 255 

Ala Thr Asn Gin Arg Cys Leu He Gly Ser Val Leu Met Ala Glu Phe 
260 265 270 

Leu Gly Ala Ala Ser Leu Leu Asp Cys Ser Arg Asp Thr Leu Glu Asp 
275 280 285 

Cys His Glu Asn Arg Val Pro Asn Leu Arg Phe Asp Ser Arg Leu Ser 
290 295 300 



Glu Ser Arg Ala Gly Leu Val He Ser Pro Leu He Ala He Pro Lys 
305 " 310 315 320 

Val Leu He He Val Val Ser Asp Gly Asp He Leu Gly Trp Ser Tyr 
325 330 335 

Thr Val Leu Gly Lys Arg Asn Ser Pro Arg Val Val Val Glu Thr His 
340 345 350 

Met Pro Ser Lys Val Pro Met Asn Lys Val Val He Gly Ser Pro Gly 
355 360 365 

Pro Met Asp Glu Thr Gly Asn Tyr Lys Met Tyr Phe Val Val Ala Gly 
370 375 380 

Val Ala Ala Thr Cys Val He Leu Thr Cys Ala Leu Leu Val Gly Lys 
385 390 395 400 

Lys Lys Cys Pro Ala His Gin Met Gly Thr Phe Ser Lys Thr Glu Pro 
405 410 415 

Leu Tyr Ala Pro Leu Pro Lys Asn Glu Phe Glu Ala Gly Gly Leu Thr 
420 * 425 430 

Asp Asp Glu Glu Val He Tyr Asp Glu Val Tyr Glu Pro Leu Phe Arg 
435 440 445 

Gly Tyr Cys Lys Gin Glu Phe Arg Glu Asp Val Asn Thr Phe Phe Gly 
450 455 460 

Ala Val Val Glu Gly Glu Arg Ala Leu Asn Phe Lys Ser Ala He Ala 
465 470 " 475 480 

Ser Met Ala Asp Arg He Leu Ala Asn Lys Ser Gly Arg Arg Asn Met 
485 490 495 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Pro Phe Lys Thr Arg Gly Ala Glu Asp Ala Ala Ala Gly Lys Asn 
15 10 15 

Arg Phe Lys Lys Ser Arg Asn Arg Glu He Leu Pro Thr Arg Leu Arg 
20 " 25 30 

Gly Thr Gly Lys Lys Thr Ala Gly Leu Ser Asn Tyr Thr Gin Pro He 
35 40 45 

Pro Trp Asn Pro Lys Phe Cys Ser Ala Arg Gly Glu Ser Asp Asn His 
50 * 55 60 

Ala Cys Lys Asp Thr Phe Tyr Arg Arg Thr Cys Cys Ala Ser Arg Ser 
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Thr Val Ser Ser Gin Pro Asp Ser Pro His Thr Pro Met Pro Thr Glu 
85 90 95 

Tyr Gly Arg Val Pro Ser Ala Lys Arg Lys Lys Leu Ser Ser Ser Asp 
100 105 HO 

Xaa Glu Gly Ala His Gin Pro Leu Val Ser Cys Lys Leu Pro Asp Ser 
115 120 125 

Gin Ala Ala Pro Ala Arg Thr Tyr Ser Ser Ala Gin Arg Tyr Thr Val 
130 135 140 

Asp Glu Val Ser Ser Pro Thr Pro Pro Gly Val Asp Ala Val Ala Asp 
145 150 155 160 

Leu Glu Thr Arg Ala Glu Leu Pro Gly Ala Thr Thr Glu Gin Thr Glu 
165 170 175 

Ser Lys Asn Lys Leu Pro Asn Gin Gin Ser Arg Leu Lys Pro Lys Pro 
180 185 190 

Thr Asn Glu His Val Gly Gly Glu Arg Cys Pro Ser Glu Gly Thr Val 
195 200 205 

Glu Ala Pro Ser Leu Gly lie Leu Ser Arg Val Gly Ala Ala lie Ala 
210 215 220 

Asn Glu Leu Ala Arg Met Arg Arg Ala Cys Leu Pro Leu Ala Ala Ser 
225 230 235 240 

Ala Ala Ala Ala Gly He Val Ala Trp Ala Ala Ala Arg Ala Leu Gin 
245 250 255 

Lys Gin Gly Arg 
260 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1305 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1305 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATG CAC CGT CCT CAT CTC AGA CGG CAC TCG CGT TAC TAC GCG AAA GGA 
Met His Arg Pro His Leu Arg Arg His Ser Arg Tyr Tyr Ala Lys Gly 
15 10 15 



GAG GTG CTT AAC AAA CAC ATG GAT TGC GGT GGA AAA CGG TGC TGC TCA 
Glu Val Leu Asn Lys His Met Asp Cys Gly Gly Lys Arg Cys Cys Ser 
20 25 30 



GGC GCA GCT GTA TTC ACT CTT TTC TGG ACT TGT GTC AGG ATT ATG CGG 
Gly Ala Ala Val Phe Thr Leu Phe Trp Thr Cys Val Arg He Met Arg 
35 40 45 

GAG CAT ATC TGC TTT GTA CGC AAC GCT ATG GAC CGC CAT TTA TTT TTG 
Glu His He Cys Phe Val Arg Asn Ala Met Asp Arg His Leu Phe Leu 
50 55 60 

AGG AAT GCT TTT TGG ACT ATC GTA CTG CTT TCT TCC TTC GCT AGC CAG 
Arg Asn Ala Phe Trp Thr He Val Leu Leu Ser Ser Phe Ala Ser Gin 
65 70 75 80 

AGC ACC GCC GCC GTC ACG TAC GAC TAC ATT TTA GGC CGT CGC GCG CTC 
Ser Thr Ala Ala Val Thr Tyr Asp Tyr He Leu Gly Arg Arg Ala Leu 
85 90 95 

GAC GCG CTA ACC ATA CCG GCG GTT GGC CCG TAT AAC AGA TAC CTC ACT 
Asp Ala Leu Thr He Pro Ala Val Gly Pro Tyr Asn Arg Tyr Leu Thr 
100 105 110 

AGG GTA TCA AGA GGC TGC GAC GTT GTC GAG CTC AAC CCG ATT TCT AAC 
Arg Val Ser Arg Gly Cys Asp Val Val Glu Leu Asn Pro He Ser Asn 
115 120 125 

GTG GAC GAC ATG ATA TCG GCG GCC AAA GAA AAA GAG AAG GGG GGC CCT 
Val Asp Asp Met He Ser Ala Ala Lys Glu Lys Glu Lys Gly Gly Pro 
130 135 140 

TTC GAG GCC TCC GTC GTC TGG TTC TAC GTG ATT AAG GGC GAC GAC GGC 
Phe Glu Ala Ser Val Val Trp Phe Tyr Val He Lys Gly Asp Asp Gly 
145 150 155 160 

GAG GAC AAG TAC TGT CCA ATC TAT AGA AAA GAG TAC AGG GAA TGT GGC 
Glu Asp Lys Tyr Cys Pro He Tyr Arg Lys Glu Tyr Arg Glu Cys Gly 
165 170 175 

GAC GTA CAA CTG CTA TCT GAA TGC GCC GTT CAA TCT GCA CAG ATG TGG 
Asp Val Gin Leu Leu Ser Glu Cys Ala Val Gin Ser Ala Gin Met Trp 
180 185 190 

GCA GTG GAC TAT GTT CCT AGC ACC CTT GTA TCG CGA AAT GGC GCG GGA 
Ala Val Asp Tyr Val Pro Ser Thr Leu Val Ser Arg Asn Gly Ala Gly 
195 200 205 

CTG ACT ATA TTC TCC CCC ACT GCT GCG CTC TCT GGC CAA TAC TTG CTG 
Leu Thr He Phe Ser Pro Thr Ala Ala Leu Ser Gly Gin Tyr Leu Leu 
210 215 220 

ACC CTG AAA ATC GGG AGA TTT GCG CAA ACA GCT CTC GTA ACT CTA GAA 
Thr Leu Lys He Gly Arg Phe Ala Gin Thr Ala Leu Val Thr Leu Glu 
225 230 235 240 

GTT AAC GAT CGC TGT TTA AAG ATC GGG TCG CAG CTT AAC TTT TTA CCG 
Val Asn Asp Arg Cys Leu Lys He Gly Ser Gin Leu Asn Phe Leu Pro 
245 250 255 

TCG AAA TGC TGG ACA ACA GAA CAG TAT CAG ACT GGA TTT CAA GGC GAA 
Ser Lys Cys Trp Thr Thr Glu Gin Tyr Gin Thr Gly Phe Gin Gly Glu 
260 265 270 

CAC CTT TAT CCG ATC GCA GAC ACC AAT ACA CGA CAC GCG GAC GAC GTA 
His Leu Tyr Pro He Ala Asp Thr Asn Thr Arg His Ala Asp Asp Val 
275 280 285 

TAT CGG GGA TAC GAA GAT ATT CTG CAG CGC TGG AAT AAT TTG CTG AGG 
Tyr Arg Gly Tyr Glu Asp He Leu Gin Arg Trp Asn Asn Leu Leu Arg 
290 295 300 



AAA AAG AAT CCT AGC GCG CCA GAC CCT CGT CCA GAT AGC GTC CCG CAA 
Lys Lys Asn Pro Ser Ala Pro Asp Pro Arg Pro Asp Ser Val Pro Gin 
305 310 315 320 

GAA ATT CCC GCT GTA ACC AAG AAA GCG GAA GGG CGC ACC CCG GAC GCA 
Glu He Pro Ala Val Thr Lys Lys Ala Glu Gly Arg Thr Pro Asp Ala 
325 330 335 

GAA AGC AGC GAA AAG AAG GCC CCT CCA GAA GAC TCG GAG GAC GAC ATG 
Glu Ser Ser Glu Lys Lys Ala Pro Pro Glu Asp Ser Glu Asp Asp Met 
340 345 350 

CAG GCA GAG GCT TCT GGA GAA AAT CCT GCC GCC CTC CCC GAA GAC GAC 
Gin Ala Glu Ala Ser Gly Glu Asn Pro Ala Ala Leu Pro Glu Asp Asp 
355 360 365 

GAA GTC CCC GAG GAC ACC GAG CAC GAT GAT CCA AAC TCG GAT CCT GAC 
Glu Val Pro Glu Asp Thr Glu His Asp Asp Pro Asn Ser Asp Pro Asp 
370 375 380 

TAT TAC AAT GAC ATG CCC GCC GTG ATC CCG GTG GAG GAG ACT ACT AAA 
Tyr Tyr Asn Asp Met Pro Ala Val He Pro Val Glu Glu Thr Thr Lys 
385 390 395 400 

AGT TCT AAT GCC GTC TCC ATG CCC ATA TTC GCG GCG TTC GTA GCC TGC 
Ser Ser Asn Ala Val Ser Met Pro He Phe Ala Ala Phe Val Ala Cys 
405 410 415 

GCG GTC GCG CTC GTG GGG CTA CTG GTT TGG AGC ATC GTA AAA TGC GCG 
Ala Val Ala Leu Val Gly Leu Leu Val Trp Ser He Val Lys Cys Ala 
420 425 430 

CGT AGC TAA 
Arg Ser 

435 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met His Arg Pro His Leu Arg Arg His Ser Arg Tyr Tyr Ala Lys Gly 
15 10 15 

Glu Val Leu Asn Lys His Met Asp Cys Gly Gly Lys Arg Cys Cys Ser 
20 25 30 

Gly Ala Ala Val Phe Thr Leu Phe Trp Thr Cys Val Arg He Met Arg 
35 40 45 

Glu His He Cys Phe Val Arg Asn Ala Met Asp Arg His Leu Phe Leu 
50 55 60 

Arq Asn Ala Phe Trp Thr He Val Leu Leu Ser Ser Phe Ala Ser Gin 
65 70 75 80 

Ser Thr Ala Ala Val Thr Tyr Asp Tyr He Leu Gly Arg Arg Ala Leu 



Asp Ala Leu Thr lie Pro Ala Val Gly Pro Tyr Asn Arg Tyr Leu Thr 
100 105 110 

Arg Val Ser Arg Gly Cys Asp Val Val Glu Leu Asn Pro lie Ser Asn 
115 120 125 

Val Asp Asp Met lie Ser Ala Ala Lys Glu Lys Glu Lys Gly Gly Pro 
130 135 140 

Phe Glu Ala Ser Val Val Trp Phe Tyr Val lie Lys Gly Asp Asp Gly 
145 150 155 160 

Glu Asp Lys Tyr Cys Pro lie Tyr Arg Lys Glu Tyr Arg Glu Cys Gly 
165 170 175 

Asp Val Gin Leu Leu Ser Glu Cys Ala Val Gin Ser Ala Gin Met Trp 
180 185 190 

Ala Val Asp Tyr Val Pro Ser Thr Leu Val Ser Arg Asn Gly Ala Gly 
195 200 205 

Leu Thr lie Phe Ser Pro Thr Ala Ala Leu Ser Gly Gin Tyr Leu Leu 
210 215 220 

Thr Leu Lys lie Gly Arg Phe Ala Gin Thr Ala Leu Val Thr Leu Glu 
225 230 235 240 

Val Asn Asp Arg Cys Leu Lys lie Gly Ser Gin Leu Asn Phe Leu Pro 
245 250 255 

Ser Lys Cys Trp Thr Thr Glu Gin Tyr Gin Thr Gly Phe Gin Gly Glu 
260 265 270 

His Leu Tyr Pro lie Ala Asp Thr Asn Thr Arg His Ala Asp Asp Val 
275 280 285 

Tyr Arg Gly Tyr Glu Asp lie Leu Gin Arg Trp Asn Asn Leu Leu Arg 
290 295 300 

Lys Lys Asn Pro Ser Ala Pro Asp Pro Arg Pro Asp Ser Val Pro Gin 
305 310 315 320 

Glu lie Pro Ala Val Thr Lys Lys Ala Glu Gly Arg Thr Pro Asp Ala 
325 330 335 

Glu Ser Ser Glu Lys Lys Ala Pro Pro Glu Asp Ser Glu Asp Asp Met 
340 345 350 

Gin Ala Glu Ala Ser Gly Glu Asn Pro Ala Ala Leu Pro Glu Asp Asp 
355 360 365 

Glu Val Pro Glu Asp Thr Glu His Asp Asp Pro Asn Ser Asp Pro Asp 
370 375 380 

Tyr Tyr Asn Asp Met Pro Ala Val lie Pro Val Glu Glu Thr Thr Lys 
385 390 395 400 

Ser Ser Asn Ala Val Ser Met Pro lie Phe Ala Ala Phe Val Ala Cys 
405 410 415 

Ala Val Ala Leu Val Gly Leu Leu Val Trp Ser He Val Lys Cys Ala 
420 425 430 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..689 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ATG GCG CCT GTA AAA GTG ACT ATA GTT TCT GCG GTC GAT TCG CAC TAC 
Met Ala Pro Val Lys Val Thr He Val Ser Ala Val Asp Ser His Tyr 
15 10 15 

AAA CTA CCT AAT TCT AGA TTT GAG CTC TCG GAT TCT GGA TGG AAA GAA 
Lys Leu Pro Asn Ser Arg Phe Glu Leu Ser Asp Ser Gly Trp Lys Glu 
20 25 30 

TTG GTT CAC GCA GTG AAA ACT ATG GCG AGT TAC GAT CGT CCG AGT ACA 
Leu Val His Ala Val Lys Thr Met Ala Ser Tyr Asp Arg Pro Ser Thr 
35 40 45 

TTA TCG GTA ATC GTG CGC CCG GCA TCT CTG TAC GAA GTT TCC GGG GAG 
Leu Ser Val He Val Arg Pro Ala Ser Leu Tyr Glu Val Ser Gly Glu 
50 55 60 

CTG TTT TCC CTT CCC AGG ATG TGC AGA CCC GTG ATT CGG TTC GGT GAG 
Leu Phe Ser Leu Pro Arg Met Cys Arg Pro Val He Arg Phe Gly Glu 
65 70 75 80 

GGG GGC GAC CCG CCT GGA GTA AGT CCC GAG TGG AGC GGC TTG GAC GCA 
Glv Gly Asp Pro Pro Gly Val Ser Pro Glu Trp Ser Gly Leu Asp Ala 
y 85 90 95 

GGG TTT TAC CAT TTG TCA TCT GGC GCG TAT GCC GCA AAA GAG TTC CAT 
Gly Phe Tyr His Leu Ser Ser Gly Ala Tyr Ala Ala Lys Glu Phe His 
100 105 HO 

TTG TGG GTG CTG GGT ACC GCT GAC ATA TGC ATG GCA GCT TTA AAC CTC 
Leu Trp Val Leu Gly Thr Ala Asp He Cys Met Ala Ala Leu Asn Leu 
115 120 125 

CCT GCG CCA AAA ACT TTC CTA ATT ACC GAA ACC GGA GGT AAA AAT TTT 
Pro Ala Pro Lys Thr Phe Leu He Thr Glu Thr Gly Gly Lys Asn Phe 
130 135 140 

GAG AGA GGA GTG GAA ATA TTT TTG GTA AAC GGA GAC AAG ACA ACG CTG 
Glu Arg Gly Val Glu He Phe Leu Val Asn Gly Asp Lys Thr Thr Leu 
145 150 155 160 

TCT CTG AGT CAC CCA TCA GTC TGG ACA ACT CTT GCC CCT TCG AGC CTG 
Ser Leu Ser His Pro Ser Val Trp Thr Thr Leu Ala Pro Ser Ser Leu 
165 170 175 



• 



AGA ACG CCC TGG CCG TAC AGC ACG GTA AAG TTT TTA AAA GTA AAA CCT 
Arg Thr Pro Trp Pro Tyr Ser Thr Val Lys Phe Leu Lys Val Lys Pro 
180 185 190 

AAC TCG GCC GCA TAC TGT GTT TCC GAC TCG GAT GAT GGC GAA CGG CAG 
Asn Ser Ala Ala Tyr Cys Val Ser Asp Ser Asp Asp Gly Glu Arg Gin 
195 200 205 

CCA AAA TTT TTT CTC GGG AGT CTA TTT AAG TCG AAG AAA CCC CGC TCC 
Pro Lys Phe Phe Leu Gly Ser Leu Phe Lys Ser Lys Lys Pro Arg Ser 
210 215 220 

CCG CGG CGC CGA CGT TA G 
Pro Arg Arg Arg Arg 
225 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ala Pro Val Lys Val Thr He Val Ser Ala Val Asp Ser His Tyr 
15 10 15 

Lys Leu Pro Asn Ser Arg Phe Glu Leu Ser Asp Ser Gly Trp Lys Glu 
20 25 30 

Leu Val His Ala Val Lys Thr Met Ala Ser Tyr Asp Arg Pro Ser Thr 
35 40 45 

Leu Ser Val He Val Arg Pro Ala Ser Leu Tyr Glu Val Ser Gly Glu 
50 55 60 

Leu Phe Ser Leu Pro Arg Met Cys Arg Pro Val He Arg Phe Gly Glu 
65 70 75 80 

Gly Gly Asp Pro Pro Gly Val Ser Pro Glu Trp Ser Gly Leu Asp Ala 
85 90 95 

Gly Phe Tyr His Leu Ser Ser Gly Ala Tyr Ala Ala Lys Glu Phe His 
100 105 110 

Leu Trp Val Leu Gly Thr Ala Asp He Cys Met Ala Ala Leu Asn Leu 
115 120 125 

Pro Ala Pro Lys Thr Phe Leu He Thr Glu Thr Gly Gly Lys Asn Phe 
130 135 140 

Glu Arg Gly Val Glu He Phe Leu Val Asn Gly Asp Lys Thr Thr Leu 
145 150 155 160 

Ser Leu Ser His Pro Ser Val Trp Thr Thr Leu Ala Pro Ser Ser Leu 
165 170 175 

Arg Thr Pro Trp Pro Tyr Ser Thr Val Lys Phe Leu Lys Val Lys Pro 
180 185 190 

Asn Ser Ala Ala Tyr Cys Val Ser Asp Ser Asp Asp Gly Glu Arg Gin 
195 200 205 
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Pro Lys Phe Phe Leu Gly Ser Leu Phe Lys Ser Lys Lys Pro Arg Ser 
210 215 220 

Pro Arg Arg Arg Arg 
225 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .380 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

ATG CGT AGC TCA GTT ACG TCA TTG TGG AGC CCT TCA GAT CAC GCC TCT 
Met Arg Ser Ser Val Thr Ser Leu Trp Ser Pro Ser Asp His Ala Ser 
15 10 15 

TCG CCC GCA AAT GCC AAG CAT TTT TAT CAT ATT TCC GAT TTC CGG CGC 
Ser Pro Ala Asn Ala Lys His Phe Tyr His lie Ser Asp Phe Arg Arg 
20 25 30 

GCG GAA ACG GCG CCT GCG GGC GGT ACG GGC GCG CGA ACT GAG GTT AAG 
Ala Glu Thr Ala Pro Ala Gly Gly Thr Gly Ala Arg Thr Glu Val Lys 
35 40 45 

CGT CGC GCT TTC ACT TTC CCA GCG GCA GCG GTA CTC AGC GCA ACT GAA 
Arg Arg Ala Phe Thr Phe Pro Ala Ala Ala Val Leu Ser Ala Thr Glu 
50 55 60 

GCC CGA ACC GGC TCG TCT ATC ACC GGC TTA AAC CGT ACT CCG TCT GCA 
Ala Arg Thr Gly Ser Ser lie Thr Gly Leu Asn Arg Thr Pro Ser Ala 
65 70 75 80 

ATA ATT TCC CTT GCA TGG TCC GAA ATG AGA AAT CTT AAG GAC CCC CTC 
lie lie Ser Leu Ala Trp Ser Glu Met Arg Asn Leu Lys Asp Pro Leu 
85 90 95 

GGG TCC CTG TCG CTG GAA ATA GCT TTA ACG AAT GTC TCT AAC TTT TCC 
Gly Ser Leu Ser Leu Glu lie Ala Leu Thr Asn Val Ser Asn Phe Ser 
100 105 110 

CTC TTG AGC TCA GAC CCC ATG GCC TTC GAA AAG TCT TCA TAT TG 
Leu Leu Ser Ser Asp Pro Met Ala Phe Glu Lys Ser Ser Tyr 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Arg Ser Ser Val Thr Ser Leu Trp Ser Pro Ser Asp His Ala Ser 
15 10 15 

Ser Pro Ala Asn Ala Lys His Phe Tyr His lie Ser Asp Phe Arg Arg 
20 " 25 30 

Ala Glu Thr Ala Pro Ala Gly Gly Thr Gly Ala Arg Thr Glu Val Lys 
35 40 45 

Arg Arg Ala Phe Thr Phe Pro Ala Ala Ala Val Leu Ser Ala Thr Glu 
50 55 60 

Ala Arg Thr Gly Ser Ser lie Thr Gly Leu Asn Arg Thr Pro Ser Ala 
65 " 70 75 80 

lie lie Ser Leu Ala Trp Ser Glu Met Arg Asn Leu Lys Asp Pro Leu 
85 90 95 

Gly Ser Leu Ser Leu Glu lie Ala Leu Thr Asn Val Ser Asn Phe Ser 
100 105 110 

Leu Leu Ser Ser Asp Pro Met Ala Phe Glu Lys Ser Ser Tyr 
115 ' 120 125 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 879 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .878 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

ATG TGG TGT CGT TTG CAC TGG ATA AGT CCT CGG TTC AGT ATT ATG CGT 
Met Trp Cys Arg Leu His Trp He Ser Pro Arg Phe Ser He Met Arg 
15 10 15 

CCC GGT TCC CGA ACT GGT AGG GTT TTG CGA GGC CAG GGG TGT GCT CTG 
Pro Gly Ser Arg Thr Gly Arg Val Leu Arg Gly Gin Gly Cys Ala Leu 
20 25 30 

TGC AGT TTC TGG CAT CGT ACT CGA ACT CCG AGT ATA AAC CTC CGG TGC 
Cys Ser Phe Trp His Arg Thr Arg Thr Pro Ser He Asn Leu Arg Cys 



CGC GCT CGG GGT CTG AGT AAT TTC CGG CTC TGC GCC CAG AGT CCG GGT 
Arg Ala Arg Gly Leu Ser Asn Phe Arg Leu Cys Ala Gin Ser Pro Gly 
50 55 60 

GAA AGG CAC AGG TTC GGT ACT CGG ACT CTG AGT CAA CAC CTC CGG CTC 
Glu Arg His Arg Phe Gly Thr Arg Thr Leu Ser Gin His Leu Arg Leu 
65 70 75 80 

TGT ACT CGG AGT CTG AGT AGC TTT CGG TAC CGT ACT CGG GGC CTG AGT 
Cys Thr Arg Ser Leu Ser Ser Phe Arg Tyr Arg Thr Arg Gly Leu Ser 
85 90 95 

GAA AAA GTG TGT TTC AGT ACT CTG AGT TCG CAT AGT GTC CGG CTC GGC 
Glu Lys Val Cys Phe Ser Thr Leu Ser Ser His Ser Val Arg Leu Gly 
100 105 110 

ACT CGA AGT CTG AGT AAA GGC CTC AGT TCC CGC GCT CTG AGT CCG AGT 
Thr Arg Ser Leu Ser Lys Gly Leu Ser Ser Arg Ala Leu Ser Pro Ser 
115 120 125 

AAA AAT CGC CGG TTC AGT ACT CGA ACT CAG AGT AGT TTT CGG TAC CGT 
Lys Asn Arg Arg Phe Ser Thr Arg Thr Gin Ser Ser Phe Arg Tyr Arg 
130 135 140 

GCT CGG GGT CTG AGT AAA CAC CTC CGT TAC CGT ACT CGA ACT CTG TGT 
Ala Arg Gly Leu Ser Lys His Leu Arg Tyr Arg Thr Arg Thr Leu Cys 
145 150 155 160 

AAA AAC CTC CGG CGC CGC GCT CGG AGC GCG AGC GGT TTC GGG GGG CGT 
Lys Asn Leu Arg Arg Arg Ala Arg Ser Ala Ser Gly Phe Gly Gly Arg 
165 170 175 

GCT ACG AGA CTG AGT AAA TAT CTC GGG TAT CGT GCT CGG GGT CTG GGC 
Ala Thr Arg Leu Ser Lys Tyr Leu Gly Tyr Arg Ala Arg Gly Leu Gly 
180 185 190 

AGG TGC CTC GGT TTC TGC ACC CGG AGT CTG AGT AAA AGT CAT CTG TTC 
Arg Cys Leu Gly Phe Cys Thr Arg Ser Leu Ser Lys Ser His Leu Phe 
195 200 205 

AGC ACT CGG AGT CTG AGT AAA CAA CGC CTC CGT TTC TGC GAT CTG CGT 
Ser Thr Arg Ser Leu Ser Lys Gin Arg Leu Arg Phe Cys Asp Leu Arg 
210 215 220 

CTG AGT AAG AGC CGC CTG TTC AGT ACT CGG AGT CTG AGT AAA ATA CCA 
Leu Ser Lys Ser Arg Leu Phe Ser Thr Arg Ser Leu Ser Lys lie Pro 
225 230 235 240 

CGG TTC CTG ACT CTG GGA CCG CGC GGT TTC CGA CTC GGT ACT CGG ACT 
Arg Phe Leu Thr Leu Gly Pro Arg Gly Phe Arg Leu Gly Thr Arg Thr 
245 250 255 

CTG AGT AAA GAC CAC CGT TTC TGC ACT CTG GGT CTG TGT AGT TTC ATG 
Leu Ser Lys Asp His Arg Phe Cys Thr Leu Gly Leu Cys Ser Phe Met 
260 265 270 

TGC CGC GCT CGG GGT CTC GGT AGA AAT CCC CGG CGC GGT CGT AGG AAA 
Cys Arg Ala Arg Gly Leu Gly Arg Asn Pro Arg Arg Gly Arg Arg Lys 
275 280 285 

CAG TGT ATT TTC TG A 
Gin Cys He Phe 
290 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Trp Cys Arg Leu His Trp He Ser Pro Arg Phe Ser He Met Arg 
15 10 15 

Pro Gly Ser Arg Thr Gly Arg Val Leu Arg Gly Gin Gly Cys Ala Leu 
20 25 30 

Cys Ser Phe Trp His Arg Thr Arg Thr Pro Ser He Asn Leu Arg Cys 
35 40 45 

Arg Ala Arg Gly Leu Ser Asn Phe Arg Leu Cys Ala Gin Ser Pro Gly 
50 55 60 

Glu Arg His Arg Phe Gly Thr Arg Thr Leu Ser Gin His Leu Arg Leu 
65 70 75 80 

Cys Thr Arg Ser Leu Ser Ser Phe Arg Tyr Arg Thr Arg Gly Leu Ser 
85 90 95 

Glu Lys Val Cys Phe Ser Thr Leu Ser Ser His Ser Val Arg Leu Gly 
100 105 110 

Thr Arg Ser Leu Ser Lys Gly Leu Ser Ser Arg Ala Leu Ser Pro Ser 
115 120 125 

Lys Asn Arg Arg Phe Ser Thr Arg Thr Gin Ser Ser Phe Arg Tyr Arg 
130 135 140 

Ala Arg Gly Leu Ser Lys His Leu Arg Tyr Arg Thr Arg Thr Leu Cys 
145 150 155 160 

Lys Asn Leu Arg Arg Arg Ala Arg Ser Ala Ser Gly Phe Gly Gly Arg 
165 170 175 

Ala Thr Arg Leu Ser Lys Tyr Leu Gly Tyr Arg Ala Arg Gly Leu Gly 
180 185 190 

Arg Cys Leu Gly Phe Cys Thr Arg Ser Leu Ser Lys Ser His Leu Phe 
195 200 205 

Ser Thr Arg Ser Leu Ser Lys Gin Arg Leu Arg Phe Cys Asp Leu Arg 
210 215 220 

Leu Ser Lys Ser Arg Leu Phe Ser Thr Arg Ser Leu Ser Lys He Pro 
225 230 235 240 

Arg Phe Leu Thr Leu Gly Pro Arg Gly Phe Arg Leu Gly Thr Arg Thr 
245 250 255 

Leu Ser Lys Asp His Arg Phe Cys Thr Leu Gly Leu Cys Ser Phe Met 
260 265 270 

Cys Arg Ala Arg Gly Leu Gly Arg Asn Pro Arg Arg Gly Arg Arg Lys 
275 280 285 

Gin Cys He Phe 
290 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .533 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

ATG CTC CCA AGC CTA CTC AAC AGG GGC TCT CCC CGG CTG AAT TCT CCT 
Met Leu Pro Ser Leu Leu Asn Arg Gly Ser Pro Arg Leu Asn Ser Pro 
15 10 15 

CCT AAG TGT TCA GAG GCC TCT GCT GTA CCA TAT AAC TAT CGT GTA GTA 
Pro Lys Cys Ser Glu Ala Ser Ala Val Pro Tyr Asn Tyr Arg Val Val 
20 25 30 

CGC CCC TCC CAG TCC GTG TCC GAT ACT GCC CCT TTT GAG AGG ATT GGG 
Arg Pro Ser Gin Ser Val Ser Asp Thr Ala Pro Phe Glu Arg He Gly 
35 40 45 

AGA TTA GAG AAT CGA AAT GAT TGG AGA GCC ACA TTC AGA CTT AAT CAC 
Arg Leu Glu Asn Arg Asn Asp Trp Arg Ala Thr Phe Arg Leu Asn His 
50 55 60 

ATT TTT ATT GAG TCG GGC GAG CTT AGC GCA GAC GGG TTA ACA ATC GCA 
He Phe He Glu Ser Gly Glu Leu Ser Ala Asp Gly Leu Thr He Ala 
65 70 75 80 

ACC AGT TCC ACA AGT TCA CTA TCC TGG TCA GCG CCC TTG TTT ATT TCG 
Thr Ser Ser Thr Ser Ser Leu Ser Trp Ser Ala Pro Leu Phe He Ser 
85 90 95 

CAC GCA ACC ATG GGT CCA AAT TTT CGC GAT TCC CTT CTA GTT TGG GAA 
His Ala Thr Met Gly Pro Asn Phe Arg Asp Ser Leu Leu Val Trp Glu 
100 105 HO 

CGT TCT TCG TCG TCT TGC GAG ACC GTG TCT AAT TTT CGG TGC GGG GTG 
Arg Ser Ser Ser Ser Cys Glu Thr Val Ser Asn Phe Arg Cys Gly Val 
115 120 125 

CAC ATG TTT CTG GTG ACG ATG GAA ATT ACA ATG ACG AGG CCG ATC GTT 
His Met Phe Leu Val Thr Met Glu He Thr Met Thr Arg Pro He Val 
130 135 140 

GCG CTC ACG ACG GCA GCC ACG GTT ACC CCA ATT AGC GTA GGG CTC ATT 
Ala Leu Thr Thr Ala Ala Thr Val Thr Pro He Ser Val Gly Leu He 
145 150 155 160 

GTC CCG AGA CGG ACA GTA ACG TTT GAA TTT TCG TTT GCG GGT GTC GGT 
Val Pro Arg Arg Thr Val Thr Phe Glu Phe Ser Phe Ala Gly Val Gly 
165 170 175 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu Pro Ser Leu Leu Asn Arg Gly Ser Pro Arg Leu Asn Ser Pro 
15 10 15 

Pro Lys Cys Ser Glu Ala Ser Ala Val Pro Tyr Asn Tyr Arg Val Val 
20 25 30 

Arg Pro Ser Gin Ser Val Ser Asp Thr Ala Pro Phe Glu Arg He Gly 
35 40 45 

Arg Leu Glu Asn Arg Asn Asp Trp Arg Ala Thr Phe Arg Leu Asn His 
50 55 60 

He Phe He Glu Ser Gly Glu Leu Ser Ala Asp Gly Leu Thr He Ala 
65 70 75 80 

Thr Ser Ser Thr Ser Ser Leu Ser Trp Ser Ala Pro Leu Phe He Ser 
85 90 95 

His Ala Thr Met Gly Pro Asn Phe Arg Asp Ser Leu Leu Val Trp Glu 
100 105 110 

Arg Ser Ser Ser Ser Cys Glu Thr Val Ser Asn Phe Arg Cys Gly Val 
115 120 125 

His Met Phe Leu Val Thr Met Glu He Thr Met Thr Arg Pro He Val 
130 135 140 

Ala Leu Thr Thr Ala Ala Thr Val Thr Pro He Ser Val Gly Leu He 
145 150 155 160 

Val Pro Arg Arg Thr Val Thr Phe Glu Phe Ser Phe Ala Gly Val Gly 
165 170 175 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GAATTCGAGC TCGGTACCCG GATAATACGT ACATGTTAAC GCAGAGGT 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCTGACCGCT AGTCGACCTG CAGTGAATAA TAAAAT 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
TGTCCGTCGA GATCCTCTAG AGTCGACGAA AGGTCAGAGA CGATGCCC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CGGATCAGAA ACTCTTTCGG TACCCGGGAT CCTCTAGA 38 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GAATACAAGC TTAGATGCAT ATTTACTCGA GCC 3 3 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GGTTTGGCGG AGCGGATATG ATCTCGACCT GCAGTGAATA ATAAAATGTG T 51 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
TGTCCGTCGA GATCCTCTAG AGTCGAGATC AGCAAAATGT TCACGGGG 48 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO:27: 
AAGCTTGGCG TAATCATG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
GGAATTCGAG CTCGGTACCT CGTGGCGAGC GCAGGCGGC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GGCCGAGTTA GGTTTTACTT TTCTAGAGGA TCCCCTCGAC GTCTGGGGCG C 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TTGCTGCGTT CCCGGGGATC CTCTAGAATT AGGTAGTTTG TAGTGCGA 
(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 
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( A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TCAAGATCCA GGAAATCCTT CGGTACCGAG CTCGAATTCG TA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 
GAATTCGAGC TCGGTACCGA AAGCTACTCA GAC 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CGCAAACAGC TCTCGTAACT CTAGAAGTTA ACGATCGCTG TT 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GAATAGCATA CCAATGCCTA TTCATTGGGA CTCGACTCTA GAGGATCCCC GGGAACG 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
TCGAGGGGAT CCTCTAGAGT CGAGGGACCC ATGGTTGCGT GC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
TTTACTAAAG CGCGGCGAAA GCTTCGTCGT GCTGGGTTCT GG 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
AAGCTTGGCG TAATCATGGT C 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GGAATTCGAG CTCGGTACCC GGATAATACG TACATGTTAA CGCAGAGG 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
ATCTATTGGA GCGTTTAGCG CGCGTCGACG AAAGGTCAGA GACGA 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
CTGCTTCATT TCTGATCCCC GGGAACG 
(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
ACCACCCCCG CGCCCCAGAC GTCGAGGGGA TCAATTATTG CGTATTGAAT A 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
ATCAGAAACT CTTTCGGTAC CGAGCTCGAA TTC 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GAATTCGAGC TCGGTACCCG GATAATACGT ACATGTTAAC GCAGAGGT 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:44: 
GCTGACCGCT AGTCGACTCT AGAGGATCCC CTC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
CGTTCCCGGG GATCCTCTAG AGTCGACGGC AGAGTCGCAG AC 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 
TGATCCAAAC TCGGATCCTC TAGAGTCGAC 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
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AAGCTTGGGC TGCAGGTCGA CTCTAGAGGA TCCCCTCGAC GTCTGGGG 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
CACACCTTTG CGCATCTCCA CAGCTCAACA ATGAATTCCA TGTTACGTCC TGTAGAAACC 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CAGGGAGGCA AACAATGAAT CAACAACTCT CCCGGGAGAT GGGGGAGGCT AACTGAAACA 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
TGCTGCGTTC CCGGGGATCC TCTAGAGTCG ACCTGCAGCC CAAGC 
(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
TCTAGAGTCG ACCTGCAGTG AATAATAAAA TGTGTGTTTG TCCGAAAT 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
CTCCATAGAA GACACCGGGA CCATGGATCC CGTCGTTTTA CAACG 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
TCGGCGGAAA TCCAGCTGAG CGCCGGTCGC TACCATTACC AGTTGGTCTG GTGTCAAAAA 
GATCTAGAAT AAGCTAGAGG ATCGATCCCC TATGGCGATC ATCAG 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CCGTCGAGAT CCTCTAGAGT CGACCTGCAG GTCGAC 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: N 
(iv) ANTI -SENSE: N 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CCTAGCACCC TTGTATCGCG 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(iii) HYPOTHETICAL: N 
(iv) ANTI -SENSE: N 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.56: 
CGCCTCGAGT CCCAATGAAT AGGCATTGG 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 



(iv) ANTI -SENSE: N 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



CGCCTCGAGG ACCCATGGTT GCGTGCG 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI-SENSE: N 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 
CTCGTCCGAA CGAGTTACAG 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 912 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: N 
(iv) ANTI- SENSE: N 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 697.. 1533 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: complement (1900.. 2784) 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: complement (2916.. 3605) 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3694.. 5124 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 5210.. 7081 
(D) OTHER INFORMATION: 

( ix) FEATURE.: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7245.. 8123 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8333.. 11290 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 11098.. 12402 
(D) OTHER INFORMATION: 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12510.. 13598 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 13792.. 15291 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 15298.. 16080 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 16129.. 17013 
(D) OTHER INFORMATION: 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: complement (17380 .. 18216) 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GGATCCCGAA GAGCTCTCCC AGAAGTTTTT CTTTTCGGAC GTATCGGAGG ACGAAGAACC 6 0 

GGCACGCGGG AGGAGCTGGA GCGACCCGGA GTCGGAGGAA GAGCAGCCTG GGTGCCGGGG 120 

AC-TGGACTTG GGCGAGGAGG ACACGGGACA CAGCTCCACC GAGTCAGAGC CCACGCAATC 180 

TGACTTAGAC TTTATTGACG ACAGCTCTCC GGCGCCGCCG CCATTTGCTA TCCCCCGCGT 240 

CCGTGCGTTA TTGCGGTGCG CGGCACCCGC AAAGACCCAC GGAAGGCTTC GGCCGCCAGG 3 00 

GCGGGTAGGC GCACTCTTAA AAGACGGAGG TTGTCATTTT CTTCTTCCTC TGACGAGGAA 360 

TCCGAGGAGA GAAGTAAAAA AGAAGAAGCG GCCTCGACCC CTGCACGGCG ACGCAAGGCC 420 

GAGGCCTCGA CGAGCAGATA GAGGAGACGC GGGGCAGAAC CTCCCCCTCC CTCCCACCCC 480 

CCTACTCTGG ACATTTATTG CCCGCTCGAT CCATTCTCAT CCAGAACTTC TTTCCCGCTC 540 

AGCCTTCACG CAGAAGCGGA CGCGCGCCCC TTTGCGACCG CCGGACATCC CGCCGCCCCC 600 

CCCCCTTCAC GCCCGGCGCA ATCCGTAGCC GTCCAACTCG GCCCAGCACA ACCGCAGTAG 660 

ACCGCCCGGA CCGCTCTCCT CTAGACACAT CCCTAA ATG GAA AAC ATG CTC GAC 714 

Met Glu Asn Met Leu Asp 
1 5 

GGG TGC TAC CCG CTG GCG CTG ATG GAC AGC GAT CAC ATT ACT GCG CAC 762 
Gly Cys Tyr Pro Leu Ala Leu Met Asp Ser Asp His He Thr Ala His 
10 I 5 20 

GCG GTA CCT CGT GGC GAG CGC AGG CGG CAA GGT GCC GCT GTC GCC TCG 810 
Ala Val Pro Arg Gly Glu Arg Arg Arg Gin Gly Ala Ala Val Ala Ser 
25 30 35 

TCG GAG TCG GCC GAC TCG GTA GAC CCG TGC ATT CGG ATC GCC TCG CGG 858 
Ser Glu Ser Ala Asp Ser Val Asp Pro Cys He Arg He Ala Ser Arg 
40 45 50 

CTC TGG CGC GAG TTA GTC GAG ATA TCG TCC GAA CTC AAG GAC GGT TAC 906 
Leu Trp Arg Glu Leu Val Glu He Ser Ser Glu Leu Lys Asp Gly Tyr 



GGA GAG TTC ACG TCA GCG AGA GAC CGC CGC AAC GCG CTG ATT GCT GCC 954 
Glv Glu Phe Thr Ser Ala Arg Asp Arg Arg Asn Ala Leu lie Ala Aia 
75 80 85 

RAC GAA CGG CTA CGT TCG GCT TTT CTG GGG GCC AGC CGG GCG ACG CGC 1002 
Asn Glu Arg Leu Arg Ser Ala Phe Leu Gly Ala Ser Arg Ala Thr Arg 
90 95 100 

GGC CTA GGT TTG AGG CCG CGG TGG GCG TCG ACG GAG AGC GTC GCC AAC 105 0 

Gly Leu Gly Leu Arg Pro Arg Trp Ala Ser Thr Glu Ser Val Ala Asn 
105 110 115 

TCC CCC ACT GAC CCG AAT AAC GGC AAC GGG TTG GGA GAA TTA GAG GAG 10 98 

Ser Pro Thr Asp Pro Asn Asn Gly Asn Gly Leu Gly Glu Leu Glu Glu 
120 125 130 

GCA ATG GAA GGG ATC GAG GGC GAT TTC TGG CTC GAC TCT CTG GAC GGT 1146 
111 Met Glu Gly lie Glu Gly Asp Phe Trp Leu Asp Ser Leu Asp Gly 
135 140 145 "0 

gap CGC TTC GAG GAC GAG AGC CGT ACC ATG CAG AGC GAG AAT ATG CGT 1194 
Asp Arg Phe Glu Asp Glu Ser Arg Thr Met Gin Ser Glu Asn Met Arg 
155 160 I 65 

TTC GTG ATC GAG AAA GAA CTG TTA TCC TGG CTG TCC CGA CAC CTG CCG 1242 
Phe Val He Glu Lys Glu Leu Leu Ser Trp Leu Ser Arg His Leu Pro 
170 175 180 

GCC GAC CTC GCG TCC GCC GAG CGA GAG ACC TCC CGG TCT CTC CTG GCG 1290 
Ala Asp Leu Ala Ser Ala Glu Arg Glu Thr Ser Arg Ser Leu Leu Ala 
185 190 I 95 

GCC GGG CAC TGG TGC TGC TTG TGG CAC CCT CGG CCG TGC CGC GAA GCG 13 3 8 

Ala Gly His Trp Cys Cys Leu Trp His Pro Arg Pro Cys Arg Glu Ala 
200 205 210 

TGT TTG TAC GAC TCG ATT TAC GTG CAG AGT CTT TTC TGC GTC GGG ACG 13 86 

Cys Leu Tyr Asp Ser He Tyr Val Gin Ser Leu Phe Cys Val Gly Thr 
215 220 225 230 

GGG AGA GTC CCG CAA TCG GAG ATG CGC CGT CGC GAA TAC CTG GCC GCC 1434 
Gly Arg Val Pro Gin Ser Glu Met Arg Arg Arg Glu Tyr Leu Ala Ala 
1 235 240 245 

TTG CGC GCC GGC GCG GCT GCC GCC AAC TCT CCC GAA GTG AGC GCC TCG 1482 
Leu Arg Ala Gly Ala Ala Ala Ala Asn Ser Pro Glu Val Ser Ala Ser 
250 255 260 

ATC TTT GCG AGG GAC GCT GGA ATC GCG CTG GCG CTG GCG CGG CGC CGT 1530 
He Phe Ala Arg Asp Ala Gly He Ala Leu Ala Leu Ala Arg Arg Arg 
265 270 275 

TGA CGGGAGAATG ACGCCCTCTA GCGGCTTCCT TACCTCCGCG TCCCTGACAA 1583 

CCTCGCGGGT TTTTACACTG TCCTCCGTCC ACTCTCCCCC CTCACCCACT CCGCGGCAGC 1643 

GAAACACAAC CCCCCCCCCC CCCCAGAAAC GAGCGACACG CGAGCGCTGC GAAATAAATA 1703 

AAGTAATATT ATTGTGTGTT TTTCACGTTG TTGCAATCGA GAGGCCGTTT GTCTGTCTGT 1763 

GTCTGTGCGG AGCTAGGCTT TCCCGGGCGG CCCCGTTCCA CCGTTCGGTT AGGCCGGTGG 1823 

CGACGGGACA TAGAGAAAGA TAGAGCGCGC GCCCTGGCGG CGAGAGGGTG TTGCGGGGGT 1883 

AAATGGGACC CTGAGCTCAC CATTTTGGCG GGGGATTGCA CGGGTAACAA AAAGCTCTCT 1943 
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CGCACATAAT GATTTCCCTT AAACAGTGGC TGTAAAAGCT TTCTTCGACT GGGACGCGCA 2 003 

CGTCCGGAGA CATGATCTTA TCGGTAGCTA CACAGTTCAT GAGGTGGGCC ACGAACGCGC 2 063 

GGATCGAGTT TTGGGAACCT TCGGGGAGGT CTTCCGGGAG GGTGAAGTTT GACAGAGGCA 212 3 

GCGCTATCAC CAGGAGGCTC CGCACCATCT CCATGCCTAT CCTTATCGCC GCGAGTCCGG 2183 

CGGCCGGCGC GCTGCTCTGG TTATTCCAGT GCGCGGACCG CGAGTGCGCC CCTCCCCGGG 224 3 

CTCTGATATA GAGCACCGGC AGCTCGACGG CGGCGGAGAA AAAAGAAAGA ATGTCCGGCC 2 3 03 

CAATGACTGG AACTTTGGGC ACGTCTCTTA TTTCCCACGC GGCGGCCCGG GGAATCTGCT 2 36 3 

TGCCCCAGAC CTTGCTTTCC AACTCCCCGT TCGGCCCCCC AACTAACTCC GACAGCGCGG 2423 

TCCACAGTCC TACCGCCGCT GCGACGGCGC GCTTAGCCGC GGGCGCTATT CGCGGGTCGT 2483 

GCGCCGTGAT ATCTTCGGCG ACCTGCAGAC TGCCCAGCCT TTCCTTCCCT TCAAAATACG 2543 

CGCGGGCGGC CTGTACGATC ACCGCGGCCA GATCGGGCCA AAAGAAAATA TCGCAACTCT 2603 

GCGACGCCCG CCAGAATCTC CCTCCGGGCA GGTCCGTGCC CCTAAAGGCC GCCGAGAAAG 2663 

CTAAGTCCAA ATGTGACGTC GGAGGTCTCG ACATGGTCGC CAACCCTCCA AATGCTACCC 2 723 

GCCGGCCCAC GCAACGCGGG CTTTTATAAA GATGGCGCGC GAGACAATAA CACTTACTCA 2 783 

TCCGCGTACG CGTTTATTAT TGTCAATATT TGTGTGGTTA TTATTACTGC TACCGCCCTT 2843 

GTTTCTGCAA GGCCCTCGCC GCGGCCCAGG CCACTATTCC GGCAGCGGCC GCCGACGCGG 2903 

CGAGCGTCGC CGCTAACGTC GGCGCCGCGG GGAGCGGGGT TTCTTCGACT TAAATAGACT 2963 

CCCGAGAAAA AATTTTGGCT GCCGTTCGCC ATCATCCGAG TCGGAAACAC AGTATGCGGC 3023 

CGAGTTAGGT TTTACTTTTA AAAACTTTAC CGTGCTGTAC GGCCAGGGCG TTCTCAGGCT 3 083 

CGAAGGGGCA AGAGTTGTCC AGACTGATGG GTGACTCAGA GACAGCGTTG TCTTGTCTCC 3143 

GTTTACCAAA AATATTTCCA CTCCTCTCTC AAAATTTTTA CCTCCGGTTT CGGTAATTAG 3203 

GAAAGTTTTT GGCGCAGGGA GGTTTAAAGC TGCCATGCAT ATGTCAGCGG TACCCAGCAC 3263 

CCACAAATGG AACTCTTTTG CGGCATACGC GCCAGATGAC AAATGGTAAA ACCCTGCGTC 3323 

CAAGCCGCTC CACTCGGGAC TTACTCCAGG CGGGTCGCCC CCCTCACCGA ACCGAATCAC 3 383 

GGGTCTGCAC ATCCTGGGAA GGGAAAACAG CTCCCCGGAA ACTTCGTACA GAGATGCCGG 3443 

GCGCACGATT ACCGATAATG TACTCGGACG ATCGTAACTC GCCATAGTTT TCACTGCGTG 3 503 

AACCAATTCT TTCCATCCAG AATCCGAGAG CTCAAATCTA GAATTAGGTA GTTTGTAGTG 3 563 

CGAATCGACC GCAGAAACTA TAGTCACTTT TACAGGCGCC ATCGCCGCTCAG 3615 

ACTCCACCCC GCTATGATGT CAGAAATATA ACGCTCTTAT TCTAGCAGAG TCAGGCCAAT 3675 

ATATACAGCT TAGAGAAG ATG CGG TTT CGG CGC ATC TGT TCA CGC TCT AGG 3726 
Met Arg Phe Arg Arg He Cys Ser Arg Ser Arg 
1 5 10- 

GCA GAA AAA CGA AGA AGA ACA ACC GAG AAT CCG CTT ACC TCA AAA CGC 3 774 

Ala Glu Lys Arg Arg Arg Thr Thr Glu Asn Pro Leu Thr Ser Lys Arg 
15 20 25 



GTT TGC GTA TTG GAT AGT TTC TCA CGG ACA ATG TCA TTG CGC CCC TA. 
Val Cvs val Leu Asp Ser Phe Ser Arg Thr Met Ser Leu Arg Pro Tyr 
30 35 40 

GCA GAA ATT TTG CCG ACC GCG GAA GGC GTC GAG CGC CTC GCC GAA CTT 
Ala Glu He Leu Pro Thr Ala Glu Gly Val Glu Arg Leu Ala Glu Leu 
45 50 55 

GTT AGT GTG ACA ATG ACA GAA CGC GCG GAA CCT GTG ACA GAG AAT ACA 
Val Ser Val Thr Met Thr Glu Arg Ala Glu Pro Val Thr Glu Asn Thr 
60 65 70 75 

GCT GTA AAC AGT ATC CCC CCG GCT AAC GAG AAC GGG CAG AAC TTC GCA 
Ala Val Asn Ser He Pro Pro Ala Asn Glu Asn Gly Gin Asn Phe Ala 
80 85 90 

TAT GCA GGC GAT GGG CCC TCG ACT ACT GAA AAA GTT GAC GGC TCG CAT 
Tvr Ala Gly Asp Gly Pro Ser Thr Thr Glu Lys Val Asp Gly Ser His 
1 95 100 105 

ACA GAC TTC GAT GAA GCA TCG AGC GAC TAC GCC GGC CCT GTC CCG CTC 
Thr Asp Phe Asp Glu Ala Ser Ser Asp Tyr Ala Gly Pro Val Pro Leu 
110 H5 120 

GCG CAA ACT AGA TTG AAG CAT TCG GAT GAA TTT CTT CAG CAC TTC CGA 
Ala Gin Thr Arg Leu Lys His Ser Asp Glu Phe Leu Gin His Phe Arg 
125 " 130 135 

GTT TTA GAC GAT TTG GTG GAG GGG GCT TAC GGG TTT ATC TGC GAC GTC 
Val Leu Asp Asp Leu Val Glu Gly Ala Tyr Gly Phe He Cys Asp Val 
140 145 150 155 

CGT CGC TAC ACC GAG GAA GAG CAA CGT CGA AGA GGG GTT AAC AGT ACT 
Arq Arg Tyr Thr Glu Glu Glu Gin Arg Arg Arg Gly Val Asn Ser Thr 
160 165 170 

AAC CAG GGG AAA TCA AAA TGT AAG CGC CTG ATA GCT AAA TAT GTG AAA 
Asn Gin Gly Lys Ser Lys Cys Lys Arg Leu He Ala Lys Tyr Val Lys 
175 180 185 

AAT GGA ACA AGG GCG GCC TCT CAG CTG GAA AAT GAA ATT TTG GTT CTC 
Asn Gly Thr Arg Ala Ala Ser Gin Leu Glu Asn Glu He Leu Val Leu 
190 195 200 

GGG CGC CTA AAT CAC GAG AAT GTT CTC AAG ATC CAG GAA ATC CTT CGG 
Gly Arg Leu Asn His Glu Asn Val Leu Lys He Gin Glu He Leu Arg 
205 210 215 

TAC CCG GAT AAT ACG TAC ATG TTA ACG CAG AGG TAT CAG TTC GAC TTG 
Tyr Pro Asp Asn Thr Tyr Met Leu Thr Gin Arg Tyr Gin Phe Asp Leu 
220 225 230 235 

TAC AGC TAC ATG TAC GAT GAA GCG TTC GAC TGG AAA GAC AGT CCA ATG 
Tvr Ser Tyr Met Tyr Asp Glu Ala Phe Asp Trp Lys Asp Ser Pro Met 
240 245 250 

CTT AAA CAG ACT AGA CGC ATC ATG AAG CAG CTC ATG TCA GCG GTC TCG 
Leu Lys Gin Thr Arg Arg He Met Lys Gin Leu Met Ser Ala Val Ser 
255 260 265 

TAT ATC CAT TCA AAG AAA CTG ATT CAC AGG GAC ATC AAA CTC GAA AAT 
Tyr He His Ser Lys Lys Leu He His Arg Asp He Lys Leu Glu Asn 
270 275 280 

ATT TTC TTA AAC TGC GAC GGC AAG ACA GTG CTG GGC GAC TTT GGA ACT 
He Phe Leu Asn Cys Asp Gly Lys Thr Val Leu Gly Asp Phe Gly Thr 
285 290 295 



GTC ACG CCT TTT GAA AAT GAG CGG GAG CCC TTC GAA TAT GGA TGG GTG 
val Thr Pro Phe Glu Asn Glu Arg Glu Pro Phe Glu Tyr Gly Trp Val 
300 305 310 315 

GGG ACC GTG GCT ACT AAC TCT CCC GAG ATA CTC GCC AGG GAT TCG TAC 
Gly Thr Val Ala Thr Asn Ser Pro Glu He Leu Ala Arg Asp Ser Tyr 
320 325 330 

TGT GAA ATT ACA GAC ATT TGG AGC TGC GGA GTA GTA TTG CTG GAA ATG 
Cvs Glu He Thr Asp He Trp Ser Cys Gly Val Val Leu Leu Glu Met 
1 335 340 345 

GTA AC- CAT GAA TTT TGC CCG ATC GGC GAT GGC GGG GGA AAT CCG CAC 
Val Ser His Glu Phe Cys Pro He Gly Asp Gly Gly Gly Asn Pro His 
350 355 360 

CAG CAA TTG CTG AAA GTT ATC GAC TCT CTC TCA GTT TGT GAT GAA GAG 
Gin Gin Leu Leu Lys Val He Asp Ser Leu Ser Val Cys Asp Glu Glu 
365 370 375 

TTC CCA GAC CCC CCG TGT AAT CTG TAC AAT TAT TTG CAT TAT GCG AGC 
Phe Pro Asp Pro Pro Cys Asn Leu Tyr Asn Tyr Leu His Tyr Ala Ser 
380 385 390 395 

ATC GAT CGC GCC GGA CAT ACG GTC CCG TCG CTC ATA CGG AAC CTC CAC 
He Asp Arg Ala Gly His Thr Val Pro Ser Leu He Arg Asn Leu His 
400 405 410 

CTT CCG GCG GAT GTG GAA TAC CCT CTA GTT AAA ATG CTT ACT TTT GAC 
Leu Pro Ala Asp Val Glu Tyr Pro Leu Val Lys Met Leu Thr Phe Asp 
415 420 425 

TGG CGT TTG AGA CCC AGC GCG GCC GAA GTA TTG GCA ATG CCA CTG TTT 
Trp Arg Leu Arg Pro Ser Ala Ala Glu Val Leu Ala Met Pro Leu Phe 
430 435 440 

TCG GCT GAA GAG GAA CGG ACC ATA ACA ATT ATT CAT GGA AAA CAT AAA 
Ser Ala Glu Glu Glu Arg Thr He Thr He He His Gly Lys His Lys 
445 450 455 

CCC ATC CGA CCC GAA ATC CGT GCG CGG GTG CCA CGG TCC ATG AGT GAA 
Pro He Arg Pro Glu He Arg Ala Arg Val Pro Arg Ser Met Ser Glu 
460 465 470 475 

GGT TAA TAATAAAGGA CGGAGATAGA GAACTGAAGC GTCAGATTTT TTTAAAAAAA 
Gly . 

TAAATGATCG AGAACTTATG ATTTGTCTTT CTTGA ATG ACC TTG CCC CAT CGA 

Met Thr Leu Pro His Arg 
1 5 

TTA ACG AAA AGA CCT TTC GCG CGT CGA TTC TGC TCG GTC TTT GTG ATA 
Leu Thr Lys Arg Pro Phe Ala Arg Arg Phe Cys Ser Val Phe Val He 
10 15 20 

CAT TAT AGT GAG ACT AAA CTC GAC CGA TAT AAC AAG ACA ATG TTA CTC 
His Tyr Ser Glu Thr Lys Leu Asp Arg Tyr Asn Lys Thr Met Leu Leu 
25 30 35 

TAT AGA CCG GAC TCA ACC ATG CGG CAT AGC GGA GGC GAC GCA AAT CAC 
Tyr Arg Pro Asp Ser Thr Met Arg His Ser Gly Gly Asp Ala Asn His 
40 45 50 

AGA GGG ATA AGG CCG AGG CGG AAA TCT ATT GGA GCG TTT AGC GCG CGC 
Arg Gly He Arg Pro Arg Arg Lys Ser He Gly Ala Phe Ser Ala Arg 
55 60 65 70 



GAA AAG 
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ACT GGA AAA CGA AAT GCG CTG ACG GAA AGC AGC TCC TCC TCC 
Thr Gly Lys Arg Asn Ala Leu Thr Glu Ser Ser Ser Ser Ser 



GAC ATG CTA GAT CCG TTT TCC ACG GAT AAG GAA TTT GGC GGT AAG TGG 
Sp Se? Su Asp Pro Phe Ser Thr Asp Lys Glu Phe Gly Gly Lys Trp 

ACG GTA GAC GGA CCT GCC GAC ATT ACT GCC GAG GTC CTT TCT CAG GCA 
Th? Val Asp Gly Pro Ala Asp lie Thr Ala Glu Val Leu Ser Gin Ala 
105 11° 115 

TGG GAC GTT CTC CAA TTA GTG AAG CAT GAA GAT GCG GAG GAG GAG AGA 
E£ As"p Val Su Gin Leu Val Lys His Glu Asp Ala Glu Glu Glu Arg 



GTG ACT TAT GAG TCC AAA CCG ACC CCG ATA CAG CCG TTC AAT GCC TGG 
SZ-?£ S£ Su Ser Lys Pro Thr Pro lie Gin Pro Phe Asn Ala Trp 
135 140 145 150 

CCG GAC GGG CCG ACT TGG AAC GCG CAG GAT TTT ACT CGA GCG CCA ATA 
Pro Asp Gly Pro Ser Trp Asn Ala Gin Asp Phe Thr Arg Ala Pro lie 
155 160 I 65 

GTT TAT CCC TCT GCG GAG GTA TTG GAC GCA GAG GCG TTG AAA GTA GGG 
?al Sr Pro Ser Ala Glu Val Leu Asp Ala Glu Ala Leu Lys Val Gly 
170 I 75 18 

GCA TTC GTT AGC CGA GTT TTA CAA TGT GTA CCG TTC ACG CGA TCA AAG 
AlS Phe Val Ser Arg Val Leu Gin Cys Val Pro Phe Thr Arg Ser Lys 
185 190 195 

AAA AGC GTT ACG GTG CGG GAT GCG CAG TCG TTT TTG GGG GAC TCG TTC 
Lys Ser Val Thr Val Arg Asp Ala Gin Ser Phe Leu Gly Asp Ser Phe 
200 205 210 

TGG AGA ATA ATG CAG AAC GTT TAC ACG GTT GTC TTA CGA CAG CAC ATA 
Trp Arg lie Met Gin Asn Val Tyr Thr Val Val Leu Arg Gin His lie 
215 220 225 230 

ACT CGA CTC AGG CAC CCT TCC AGC AAA AGC ATT GTT AAC TGC AAC GAC 
Thr Arg Leu Arg His Pro Ser Ser Lys Ser lie Val Asn Cys Asn Asp 
235 240 245 

CCT CTA TGG TAC GCC TAC GCG AAT CAA TTT CAC TGG AGA GGA ATG CGC 
Pro Leu Trp Tyr Ala Tyr Ala Asn Gin Phe His Trp Arg Gly Met Arg 
250 255 260 

GTG CCG TCG CTT AAA TTA GCC TCT CCC CCG GAG GAG AAT ATT CAA CAC 
Val Pro Ser Leu Lys Leu Ala Ser Pro Pro Glu Glu Asn He Gin His 
265 * 270 275 

GGC CCA ATG GCC GCC GTT TTT AGA AAC GCG GGG GCT GGT CTG TTC CTG 
Gly Pro Met Ala Ala Val Phe Arg Asn Ala Gly Ala Gly Leu Phe Leu 
280 285 290 

TGG CCT GCC ATG CGC GCA GCC TTT GAA GAG CGC GAC AAG CGA CTG TTA 
Trp Pro Ala Met Arg Ala Ala Phe Glu Glu Arg Asp Lys Arg Leu Leu 
295 300 305 310 

AGA GCA TGC CTG TCT TCA CTC GAT ATC ATG GAC GCA GCC GTC CTC GCG 
Arg Ala Cys Leu Ser Ser Leu Asp He Met Asp Ala Ala Val Leu Ala 
31 5 320 325 

TCG TTT CCA TTT TAC TGG CGC GGC GTC CAA GAC ACC TCG CGC TTC GAG 
Ser Phe Pro Phe Tyr Trp Arg Gly Val Gin Asp Thr Ser Arg Phe Glu 
330 335 340 



CCT GCG CTG GGC TGT TTG TCA GAG TAC TTT GCA CTA GTG GTG TTA CTG 
Pro Ala Leu Gly Cys Leu Ser Glu Tyx Phe Ala Leu Val Val Leu Leu 
345 350 355 

GCC GAG ACG GTC TTA GCG ACC ATG TTC GAC CAC GCA CTG GTA TTC ATG 
Ala Glu Thr Val Leu Ala Thr Met Phe Asp His Ala Leu Val Phe Met 
360 365 370 

AGG GCG CTG GCA GAC GGC AAT TTC GAT GAC TAT GAC GAA ACT AGA TAT 
Arq Ala Leu Ala Asp Gly Asn Phe Asp Asp Tyr Asp Glu Thr Arg Tyr 
375 380 385 390 

ATA GAC CCC GTT AAA AAC GAG TAC CTG AAC GGA GCC GAG GGT ACT CTG 
He Asp Pro Val Lys Asn Glu Tyr Leu Asn Gly Ala Glu Gly Thr Leu 
395 400 4 05 

TTA CGG GGC ATA GTG GCC TCC AAC ACC GCT CTG GCG GTG GTT TGC GCA 
Leu Arg Gly lie Val Ala Ser Asn Thr Ala Leu Ala Val Val Cys Ala 
410 415 420 

AAC ACC TAT TCG ACG ATA AGA AAA CTC CCG TCC GTG GCA ACT AGC GCG 
Asn Thr Tyr Ser Thr He Arg Lys Leu Pro Ser Val Ala Thr Ser Ala 
425 430 435 

TGC AAT GTT GCC TAC AGG ACC GAA ACG CTG AAA GCG AGG CGC CCT GGC 
Cys Asn Val Ala Tyr Arg Thr Glu Thr Leu Lys Ala Arg Arg Pro Gly 
440 * 445 450 

ATG AGC GAC ATA TAC CGG ATA TTA CAA AAA GAG TTT TTC TTT TAC ATT 
Met Ser Asp He Tyr Arg He Leu Gin Lys Glu Phe Phe Phe Tyr He 
455 ' 460 465 470 

GCG TGG CTC CAG AGG GTT GCA ACA CAC GCA AAT TTC TGT TTA AAC ATT 
Ala Trp Leu Gin Arg Val Ala Thr His Ala Asn Phe Cys Leu Asn He 
475 480 485 

CTG AAG AGA AGC GTG GAT ACG GGG GCC CCG CCA TTT TTG TTC AGG GCC 
Leu Lvs Arq Ser Val Asp Thr Gly Ala Pro Pro Phe Leu Phe Arg Ala 
490 495 500 

AGC TCG GAG AAG CGG CTG CAG CAG TTA AAT AAA ATG CTC TGC CCC CTT 
Ser Ser Glu Lys Arg Leu Gin Gin Leu Asn Lys Met Leu Cys Pro Leu 
505 510 515 

CTC GTG CCG ATT CAA TAT GAA GAC TTT TCG AAG GCC ATG GGG TCT GAG 
Leu Val Pro He Gin Tyr Glu Asp Phe Ser Lys Ala Met Gly Ser Glu 
520 525 530 

CTC AAG AGG GAA AAG TTA GAG ACA TTC GTT AAA GCT ATT TCC AGC GAC 
Leu Lys Arg Glu Lys Leu Glu Thr Phe Val Lys Ala He Ser Ser Asp 
535 540 545 550 

AGG GAC CCG AGG GGG TCC TTA AGA TTT CTC ATT TCG GAC CAT GCA AGG 
Arq Asp Pro Arg Gly Ser Leu Arg Phe Leu He Ser Asp His Ala Arg 
555 560 565 

GAA ATT ATT GCA GAC GGA GTA CGG TTT AAG CCG GTG ATA GAC GAG CCG 
Glu He He Ala Asp Gly Val Arg Phe Lys Pro Val He Asp Glu Pro 
570 575 580 

GTT CGG GCT TCA GTT GCG CTG AGT ACC GCT GCC GCT GGG AAA GTG AAA 
Val Arg Ala Ser Val Ala Leu Ser Thr Ala Ala Ala Gly Lys Val Lys 
585 590 595 

GCG CGA CGC TTA ACC TCA GTT CGC GCG CCC GTA CCG GGC GCA GGC GCC 
Ala Arg Arg Leu Thr Ser Val Arg Ala Pro Val Pro Gly Ala Gly Ala 
600 605 610 



GTT TCC GCG CGC CGG AAA TCG GAA ATA TGA TAAAAATGCT TGGCATTTGC 
Val Ser Ala Arg Arg Lys Ser Glu He 
615 620 

GGGCGAAGAG GCGTGATCTG AAGGGCTCCA CAATGACGTA ACTGAGCTAC GCATCCCTAT 

AAAGTGTACC CGCTGACCGC TAGCCCATAC AGTGTTACAG GAGGGGAGAG AG AC AACTT C 

AGCTCGAAGT CTGAAGAGAC ATC ATG AGC GGC TTC AGT AAC ATA GGA TCG 
Met Ser Gly Phe Ser Asn He Gly Ser 
1 5 

ATT GCC ACC GTT TCC CTA GTA TGC TCG CTT TTG TGC GCA TCT GTA TTA 
He Ala Thr Val Ser Leu Val Cys Ser Leu Leu Cys Ala Ser Val Leu 
10 15 20 25 

GGG GCG CCG GTA CTG GAC GGG CTC GAG TCG AGC CCT TTC CCG TTC GGG 
Gly Ala Pro Val Leu Asp Gly Leu Glu Ser Ser Pro Phe Pro Phe Gly 
30 35 40 

GGC AAA ATT ATA GCC CAG GCG TGC AAC CGC ACC ACG ATT GAG GTG ACG 
Gly Lys He He Ala Gin Ala Cys Asn Arg Thr Thr He Glu Val Thr 
45 50 55 

GTC CCG TGG AGC GAC TAC TCT GGT CGC ACC GAA GGA GTG TCA GTC GAG 
Val Pro Trp Ser Asp Tyr Ser Gly Arg Thr Glu Gly Val Ser Val Glu 
60 65 70 

GTG AAA TGG TTC TAC GGG AAT AGT AAT CCC GAA AGC TTC GTG TTC GGG 
Val Lys Trp Phe Tyr Gly Asn Ser Asn Pro Glu Ser Phe Val Phe Gly 
75 80 85 

GTG GAT AGC GAA ACG GGC AGT GGA CAC GAG GAC CTG TCT ACG TGC TGG 
Val Asp Ser Glu Thr Gly Ser Gly His Glu Asp Leu Ser Thr Cys Trp 
90 95 100 105 

GCT CTA ATC CAT AAT CTG AAC GCG TCT GTG TGC AGG GCG TCT GAC GCC 
Ala Leu He His Asn Leu Asn Ala Ser Val Cys Arg Ala Ser Asp Ala 
110 H5 120 

GGG ATA CCT GAT TTC GAC AAG CAG TGC GAA AAA GTG CAG AGA AGA CTG 
Gly He Pro Asp Phe Asp Lys Gin Cys Glu Lys Val Gin Arg Arg Leu 
125 130 135 

CGC TCC GGG GTG GAA CTT GGT AGT TAC GTG TCT GGC AAT GGA TCC CTG 
Arg Ser Gly Val Glu Leu Gly Ser Tyr Val Ser Gly Asn Gly Ser Leu 
140 145 150 

GTG CTG TAC CCA GGG ATG TAC GAT GCC GGC ATC TAC GCC TAC CAG CTC 
Val Leu Tyr Pro Gly Met Tyr Asp Ala Gly He Tyr Ala Tyr Gin Leu 
155 160 165 

TCA GTG GGT GGG AAG GGA TAT ACC GGG TCT GTT TAT CTA GAC GTC GGA 
Ser Val Gly Gly Lys Gly Tyr Thr Gly Ser Val Tyr Leu Asp Val Gly 
170 175 180 185 

CCA AAC CCC GGA TGC CAC GAC CAG TAT GGG TAC ACC TAT TAC AGC CTG 
Pro Asn Pro Gly Cys His Asp Gin Tyr Gly Tyr Thr Tyr Tyr Ser Leu 
190 195 200 

GCC GAC GAG GCG TCA GAC TTA TCA TCT TAT GAC GTA GCC TCG CCC GAA 
Ala Asp Glu Ala Ser Asp Leu Ser Ser Tyr Asp Val Ala Ser Pro Glu 
205 210 215 

CTC GAC GGT CCT ATG GAG GAA GAT TAT TCC AAT TGT CTA GAC ATG CCC 
Leu Asp Gly Pro Met Glu Glu Asp Tyr Ser Asn Cys Leu Asp Met Pro 
220 225 230 
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CCG CTA CGC CCA TGG ACA ACC GTT TGT TCG CAT GAC GTC GAG GAG CAG 7991 
Pro Leu Arg Pro Trp Thr Thr Val Cys Ser His Asp Val Glu Glu Gin 
235 240 245 

GAA AAC GCC ACG GAC GAG CTT TAC CTA TGG GAC GAG GAA TGC GCC GGT 8 03 9 
Glu Asn Ala Thr Asp Glu Leu Tyr Leu Trp Asp Glu Glu Cys Ala Gly 
250 255 260 265 

r-TG GAC GAG TAC GTC GAC GAA AGG TCA GAG ACG ATG CCC AGG ATG 8 08 7 
Pro Leu Asp Glu Tyr Val Asp Glu Arg Ser Glu Thr Met Pro Arg Met 
270 275 280 

GTT GTC TTT TCA CCG CCC TCT ACG CTC CAG CAG TAG CCACCCGAGA 813 3 
Val Val Phe Ser Pro Pro Ser Thr Leu Gin Gin . 
285 290 

GTGTTTTTTG TGAGCGCCCA CGCAACATAC CTAACTGCTT CATTTCTGAT CAATTATTGC 8193 

GTATTGAATA AATAAACAGT ACAAAAGCAT CAGGTGTGGT TTGCGTGTCT GTGCTAAACC 82 53 

ATGGCGTGTG CGGGTGAAAC CGTAAATTAC GTGATAATAA ATAGCATAGG AGTTGGCGTG 8313 

CAGCGTATTT CGCCGAGAGA TGGGGACAAT GTTAGTGTTG CGCCTTTTCC TACTTGCAGT 8373 

AGCGGACGCG GCGTTGCCGA CCGGCAGATT CTGCCGAGTT TGGAAGGTGC CTCCGGGAGG 8433 

AACCATCCAA GAGAAC CTGG CGGTGCTCGC GGAATCGCCG GTCACGGGAC ACGCGACATA 8493 

TCCGCCGCCT GAAGGCGCCG TCAGCTTTCA GATTTTTGCG GACACCCCTA CTTTGCGCAT 8553 

TCGCTACGGC GCTACGGAGG ACGAACTTGC ACTGGAGCGC GGGACGTCCG CCTCAGACGC 8613 

GGACAACGTG ACATTTTCGC TGTCATATCG CCCGCGCCCA GAAATTCACG GAG C ATACTT 8673 

CACCATAGGG GTATTCGCTA CTGGCCAGAG CACGGAAAGC AGCTATTCGG TCATCAGTCG 873 3 

GGTCTTAGTT AACGCCTCTC TGGAACGGTC CGTGCGCCTG GAAACGCCGT GCGATGAAAA 8793 

TTTTTTGCAG AACGAGCCTA CATGGGGCTC GAAGCGTTGG TTAGGCCCCC CGTCGCCTTA 8853 

TGTGCGAGAT AACGATGTCG CCGTGTTGAC AAAAGCGCAG TACATTGGGG AGTGCTACTC 8913 

CAACTCGGCG GCCCAGACGG GGCTCACGTC TCTCAACATG ACCTTTTTCT ATTCGCCTAA 8973 

AAGAATAGTA AACGTCACGT GGACAACCGG CGGCCCCTCC CCCTCGCGCA TAACGGTATA 9033 

CTCGTCGCGG GAGAACGGGC AGCCCGTGTT GAGGAACGTT TCTGACGGGT TCTTGGTTAA 9093 

GTACACTCCC GACATTGACG GCCGGGCCAT GATAAACGTT ATTGCCAATT ATTCGCCGGC 9153 

GGACTCCGGC AGCGTCCTCG CGTTTACGGC CTTTAGGGAA GGAAAACTCC CATCCGCGAT 9213 

TCAACTGCAC CGGATAGATA TGTCCGGGAC TGAGCCGCCG GGGACTGAAA CGACCTTCGA 9273 

CTGTCAAAAA ATGATAGAAA CCCCGTACCG AGCGCTCGGG AGCAATGTTC CCAGGGACGA 9333 

CTCTATCCGT CCGGGGGCCA CTCTGCCTCC GTTCGATACC GCAGCACCTG ATTTCGATAC 9393 

AGGTACTTCC CCGACCCCCA CTACCGTGCC AGAGCCAGCC ATTACTACAC TCATACCGCG 9453 

CAGCACTAGC GATATGGGAT TCTTCTCCAC GGCACGTGCT ACCGGATCAG AAACTCTTTC 9513 

GGTACCCGTC CAGGAAACGG ATAGAACTCT TTCGACAACT CCTCTTACCC TTCCACTGAC 9573 

TCCCGGTGAG TCAGAAAATA CACTGTTTCC TACGACCGCG CCGGGGATTT CTACCGAGAC 9633 

CCCGAGCGCG GCACATGAAA CTACACAGAC CCAGAGTGCA GAAACGGTGG TCTTTACTCA 9693 



GAGTCCGAGT 
TCAGACTCCG 
GTTGTTTACT 
AACCGAGGCA 
CACGCCCCCC 
GAGTACGGTA 
TTCGAGTACT 
TCAGACTTCG 
CACTCAGGCC 
GTTGACTCAG 
GGAAATTACT 
GCCAGAAACT 
GACGCATAAT 
CTACACAGAA 
GACGGCAGAG 
GGATGGCGAA 
TAATTGGAAA 
GGTTTTTAAC 
AACATCGGTC 
TTTCGACACG 
TGATCCGCAA 
ATGGAGCCTT 
TACCGCTGGC 
TGGCCGGGGA 
TACGCGAAAG 
GGCGCAGCTG 
TTTGTACGCA 
GTACTGCTTT 
GGCCGTCGCG 
ACTAGGGTAT 
ATGATATCGG 
TTCTACGTGA 
TACAGGGAAT 
TGGGCAGTGG 



ACCGAGTCGG 
AGTACTGAAC 
CAGACTCCGA 
CCTGCCCAGA 
GAAACCGCTC 
ACGGAGGTGT 
GAACCGGCGA 
AGTGCCGAGC 
CCGAGTACGG 
AGTCCGAGTA 
CAGACCCCGA 
GCACAGAGCA 
ACTGAACCGA 
AATAAGACTT 
TCGCAGACGC 
GTAACGGCGA 
GTAGACCTCG 
AGTAATGAGA 
CAGCTAATGT 
TCTCTTATAG 
ACGGCATACT 
AATAATGTTT 
CAAAGTACAG 
GGCAATATAT 
GAGAGGTGCT 
TATTCACTCT 
ACGCTATGGA 
CTTCCTTCGC 

CGCTCGACGC 
CAAGAGGCTG 
CGGCCAAAGA 
TTAAGGGCGA 
GTGGCGACGT 
ACTATGTTCC 



AAACCGCGCG 
AGGCGGCTCT 
GTGCTGAACA 
CCCCGAGCAC 
GCGCTCCGAG 
TTACTCAGAC 
TTTTTACTCG 
CGGACACTAT 
TACCGAAAGC 
CCGAACCTGT 
GCGCGGCACC 
CACCCCTGGC 
GGACTTATCC 
TATCGTTTCC 
CCCTATTGGA 
CTTGCGTTTC 
TAGATGTAAT 
AATGGCAGAA 
GCCTGTCGTG 
CGAGGGAAAA 
GCACAATAAC 
CACTGCCGGA 
TGTGGAAGAG 
ACGAATGCAC 
TAACAAACAC 
TTTCTGGACT 
CCGCCATTTA 
TAGCCAGAGC 
GCTAACCATA 
CGACGTTGTC 
AAAAGAGAAG 
CGACGGCGAG 
ACAACTGCTA 
TAGCACCCTT 



-147- 

GTCCCAGAGT CAGGAACCGT GGTATTTTAC 97 5 3 

TACTCAGACG CAGATCGCAG AAACGGAGGC 9813 
GATGACTTTT ACTCAGACTC CGGGTGCAGA 98 73 
GATACCCGAG ATATTTACTC AGTCTCGTAG 9933 
CGCGGCGCCG GAGGTTTTTA CACAGAGTTC 99 93 

CCCGAGCACG GTACCGAAAA CTACTCTGAG 100 53 

GACTCAGAGC GCGGGAACTG AGGCCTTTAC 10113 

GCGAACTCAG AGTACTGAAA CACACTTTTT 10173 

TACTCAGACT CCGAGTACAG AGCCGGAGGT 10233 

GCCTTTCACC CGGACTCTGG GCGCAGAGCC 102 93 

GGAGGTTTAT ACTCGGAGTT CGAGTACGAT 103 53 

CTCGCAAAAC CCTACCAGTT CGGGAAC CGG 10413 

AGTGCAAACG ACACCACATA CCCAGAAACT 10473 

TACTGTTGTT TCAGAATTCC ATGAGATGTC 1053 3 

CGTCAAAATT GTAGAGGTGA AGTTTTCAAA 10593 

CACCGTCAAA TCTCCCTATA GGGTAGAAAC 10653 

GGATGAAATT TCTGGGAACA GTCCCGCCGG 10713 

ACAGCTGTAC TACAGAGTAA CCGATGGAAG 10773 

CACGAGCCAT TCTCCGGAAC CTTACTGTCT 10833 

AGATATCGCG CCAGAGTTAT ACTTTACCTC 108 93 

TCTGCCGTCC GGCGTTGTTC CGAGATTCGA 10953 

ATATTTGACG GCCACGACCG TTGTTTCGCA 11013 
CAGCGCGAGA GCAGGCGAGG CGTGGATTTC 11073 
CGTCCTCATC TCAGACGGCA CTCGCGTTAC 11133 
ATGGATTGCG GTGGAAAACG GTGCTGCTCA 11193 
TGTGTCAGGA TTATGCGGGA GCATATCTGC 11253 
TTTTTGA GGAATGCTTT TTGGACTATC 11310 
ACCGCCGCCG TCACGTACGA CTACATTTTA 11370 
CCGGCGGTTG GCCCGTATAA CAGATACCTC 11430 
GAGCTCAACC CGATTTCTAA CGTGGACGAC 11490 
GGGGGCCCTT TCGAGGCCTC CGTCGTCTGG 11550 
GACAAGTACT GTCCAATCTA TAGAAAAGAG 11610 
TCTGAATGCG CCGTTCAATC TGCACAGATG 11670 
GTATCGCGAA ATGGCGCGGG ACTGACTATA 11730 
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TTCTCCCCCA CTGCTGCGCT CTCTGGCCAA TACTTGCTGA CCCTGAAAAT CGGGAGATTT 

GCGCAAACAG CTCTCGTAAC TCTAGAAGTT AACGATCGCT GTTTAAAGAT CGGGTCGCAG 

CTTAACTTTT TACCGTCGAA ATGCTGGACA ACAGAACAGT ATCAGACTGG ATTTCAAGGC 

GAACACCTTT ATCCGATCGC AG ACAC CAAT ACACGACACG CGGACGACGT ATATCGGGGA 

TACGAAGATA TTCTGCAGCG CTGGAATAAT TTGCTGAGGA AAAAGAATCC TAGCGCGCCA 

GACCCTCGTC CAGATAGCGT CCCGCAAGAA ATTCCCGCTG TAACCAAGAA AGCGGAAGGG 

CGCACCCCGG ACGCAGAAAG CAGCGAAAAG AAGGCCCCTC CAGAAGACTC GGAGGACGAC 

ATGCAGGCAG AGGCTTCTGG AGAAAATCCT GCCGCCCTCC CCGAAGACGA CGAAGTCCCC 

GAGGACACCG AG C ACGATG A TCCAAACTCG GATCCTGACT ATTACAATGA CATGCCCGCC 

GTGATCCCGG TGGAGGAGAC TACTAAAAGT TCTAATGCCG TCTCCATGCC CATATTCGCG 

GCGTTCGTAG CCTGCGCGGT CGCGCTCGTG GGGCTACTGG TTTGGAG CAT CGTAAAATGC 

GCGCGTAGCT AA TCGAGCCTAG AATAGGTGGT TTCTTCCTAC ATGCCACGCC 

TCACGCTCAT AATATAAATC ACATGGAATA GCATACCAAT GCCTATTCAT TGGGACGTTC 

GAAAAGC ATG GCA TCG CTA CTT GGA ACT CTG GCT CTC CTT GCC GCG ACG 
Met Ala Ser Leu Leu Gly Thr Leu Ala Leu Leu Ala Ala Thr 



11790 
11850 
11910 
11970 
12030 
12090 
12150 
12210 
12270 
12330 
12390 
12442 
12502 
12551 



CTC GCA CCC TTC GGC GCG ATG GGA ATC GTG ATC ACT GGA AAT CAC GTC 
Leu Ala Pro Phe Gly Ala Met Gly lie Val lie Thr Gly Asn His Val 
15 20 25 30 

TCC GCC AGG ATT GAC GAC GAT CAC ATC GTG ATC GTC GCG CCT CGC CCC 
Ser Ala Arg lie Asp Asp Asp His He Val lie Val Ala Pro Arg Pro 
35 40 45 

GAA GCT ACA ATT CAA CTG CAG CTA TTT TTC ATG CCT GGC CAG AGA CCC 
Glu Ala Thr He Gin Leu Gin Leu Phe Phe Met Pro Gly Gin Arg Pro 
50 55 60 

CAC AAA CCC TAC TCA GGA ACC GTC CGC GTC GCG TTT CGG TCT GAT ATA 
His Lys Pro Tyr Ser Gly Thr Val Arg Val Ala Phe Arg Ser Asp He 
65 70 75 

ACA AAC CAG TGC TAC CAG GAA CTT AGC GAG GAG CGC TTT GAA AAT TGC 
Thr Asn Gin Cys Tyr Gin Glu Leu Ser Glu Glu Arg Phe Glu Asn Cys 



ACT CAT CGA TCG TCT TCT GTT TTT GTC GGC TGT AAA GTG ACC GAG TAC 
Thr His Arg Ser Ser Ser Val Phe Val Gly Cys Lys Val Thr Glu Tyr 
95 " 100 105 HO 

ACG TTC TCC GCC TCG AAC AGA CTA ACC GGA CCT CCA CAC CCG TTT AAG 
Thr Phe Ser Ala Ser Asn Arg Leu Thr Gly Pro Pro His Pro Phe Lys 
115 " 120 125 

CTC ACT ATA CGA AAT CCT CGT CCG AAC GAC AGC GGG ATG TTC TAC GTA 
Leu Thr He Arg Asn Pro Arg Pro Asn Asp Ser Gly Met Phe Tyr Val 
130 135 140 

ATT GTT CGG CTA GAC GAC ACC AAA GAA CCC ATT GAC GTC TTC GCG ATC 
He Val Arg Leu Asp Asp Thr Lys Glu Pro He Asp Val Phe Ala He 
145 * 150 155 



CAA CTA TCG GTG TAT CAA TTC GCG AAC ACC GCC GCG ACT CGC GGA CTC 
Gin Leu Ser Val Tyr Gin Phe Ala Asn Thr Ala Ala Thr Arg Gly Leu 
160 165 170 

TAT TCC AAG GCT TCG TGT CGC ACC TTC GGA TTA CCT ACC GTC CAA CTT 
Tyr Ser Lys Ala Ser Cys Arg Thr Phe Gly Leu Pro Thr Val Gin Leu 
!7 5 180 185 190 

GAG GCC TAT CTC AGG ACC GAG GAA AGT TGG CGC AAC TGG CAA GCG TAC 
Glu Ala Tyr Leu Arg Thr Glu Glu Ser Trp Arg Asn Trp Gin Ala Tyr 
195 200 205 

GTT GCC ACG GAG GCC ACG ACG ACC AGC GCC GAG GCG ACA ACC CCG ACG 
Val Ala Thr Glu Ala Thr Thr Thr Ser Ala Glu Ala Thr Thr Pro Thr 
210 215 220 

CCC GTC ACT GCA ACC AGC GCC TCC GAA CTT GAA GCG GAA CAC TTT ACC 
Pro Val Thr Ala Thr Ser Ala Ser Glu Leu Glu Ala Glu His Phe Thr 
225 230 235 

TTT CCC TGG CTA GAA AAT GGC GTG GAT CAT TAC GAA CCG ACA CCC GCA 
Phe Pro Trp Leu Glu Asn Gly Val Asp His Tyr Glu Pro Thr Pro Ala 
240 245 250 

AAC GAA AAT TCA AAC GTT ACT GTC C3T CTC GGG ACA ATG AGC CCT ACG 
Asn Glu Asn Ser Asn Val Thr Val Arg Leu Gly Thr Met Ser Pro Thr 
255 260 265 270 

CTA ATT GGG GTA ACC GTG GCT GCC GTC GTG AGC GCA ACG ATC GGC CTC 
Leu He Gly Val Thr Val Ala Ala Val Val Ser Ala Thr He Gly Leu 
275 280 285 

GTC ATT GTA ATT TCC ATC GTC ACC AGA AAC ATG TGC ACC CCG CAC CGA 
Val He Val He Ser He Val Thr Arg Asn Met Cys Thr Pro His Arg 
290 295 300 

AAA TTA GAC ACG GTC TCG CAA GAC GAC GAA GAA CGT TCC CAA ACT AGA 
Lys Leu Asp Thr Val Ser Gin Asp Asp Glu Glu Arg Ser Gin Thr Arg 
305 310 315 

AGG GAA TCG CGA AAA TTT GGA CCC ATG GTT GCG TGC GAA ATA AAC AAG 
Arg Glu Ser Arg Lys Phe Gly Pro Met Val Ala Cys Glu He Asn Lys 
320 325 330 

GGG GCT GAC CAG GAT AGT GAA CTT GTG GAA CTG GTT GCG ATT GTT AAC 
Gly Ala Asp Gin Asp Ser Glu Leu Val Glu Leu Val Ala He Val Asn 
335 340 345 350 

CCG TCT GCG CTA AGC TCG CCC GAC TCA ATA AAA ATG TGA TTAAGTCTGA 
Pro Ser Ala Leu Ser Ser Pro Asp Ser He Lys Met 
355 360 

ATGTGGCTCT CCAATCATTT CGATTCTCTA ATCTCCCAAT CCTCTCAAAA GGGGCAGTAT 

CGGACACGGA CTGGGAGGGG CGTACACGAT AGTTATATGG TACAGCAGAG GCCTCTGAAC 

ACTTAGGAGG AGAATTCAGC CGGGGAGAGC CCCTGTTGAG TAGGCTTGGG AGCATATTGC 

AGG ATG AAC ATG TTA GTG ATA GTT CTC GCC TCT TGT CTT GCG CGC CTA 
Met Asn Met Leu Val He Val Leu Ala Ser Cys Leu Ala Arg Leu 
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ACT TTT GCG ACG CGA CAC GTC CTC TTT TTG GAA GGC ACT CAG GCT GTC 
Thr Phe Ala Thr Arg His Val Leu Phe Leu Glu Gly Thr Gin Ala Val 



CTC GGG GAA GAT GAT CCC AGA AAC GTT CCG GAA GGG ACT GTA ATC AAA 
Leu Gly Glu Asp Asp Pro Arg Asn Val Pro Glu Gly Thr Val He Lys 
35 40 45 

TGG ACA AAA GTC CTG CGG AAC GCG TGC AAG ATG AAG GCG GCC GAT GTC 
Trp Thr Lys Val Leu Arg Asn Ala Cys Lys Met Lys Ala Ala Asp Val 
50 55 60 

TGC TCT TCG CCT AAC TAT TGC TTT CAT GAT TTA ATT TAC GAC GGA GGA 
Cvs Ser Ser Pro Asn Tyr Cys Phe His Asp Leu He Tyr Asp Gly Gly 
Y 65 70 75 

AAG AAA GAC TGC CCG CCC GCG GGA CCC CTG TCT GCA AAC CTG GTA ATT 
Lys Lys Asp Cys Pro Pro Ala Gly Pro Leu Ser Ala Asn Leu Val lie 
80 85 90 95 

TTA CTA AAG CGC GGC GAA AGC TTC GTC GTG CTG GGT TCT GGG CTA CAC 
Lei Leu Lys Arg Gly Glu Ser Phe Val Val Leu Gly Ser Gly Leu Hxs 
100 1° 5 110 

AAC AGC AAT ATA ACT AAT ATC ATG TGG ACA GAG TAC GGA GGC CTG CTC 
Asn Ser Asn He Thr Asn He Met Trp Thr Glu Tyr Gly Gly Leu Leu 
U5 120 125 

TTT GAT CCT GTA ACT CGT TCG GAC GAG GGA ATC TAT TTT CGA CGG ATC 
Phe Asp Pro Val Thr Arg Ser Asp Glu Gly He Tyr Phe Arg Arg He 
130 135 140 

TCT CAG CCA GAT CTG GCC ATG GAA ACT ACA TCG TAC AAC GTC AGC GTT 
Ser Gin Pro Asp Leu Ala Met Glu Thr Thr Ser Tyr Asn Val Ser Val 
145 150 155 

CTT TCG CAC GTA GAC GAG AAG GCT CCA GCA CCG CAC GAG GTG GAG ATA 
Leu Ser His Val Asp Glu Lys Ala Pro Ala Pro His Glu Val Glu He 
160 165 170 175 

GAC ACC ATC AAG CCG TCA GAG GCC CAC GCG CAC GTG GAA TTA CAA ATG 
Asn Thr He Lys Pro Ser Glu Ala His Ala His Val Glu Leu Gin Met 
180 185 190 

CTG CCG TTT CAT GAA CTC AAC GAC AAC AGC CCC ACC TAT GTG ACC CCT 
Leu Pro Phe His Glu Leu Asn Asp Asn Ser Pro Thr Tyr Val Thr Pro 
195 200 205 

GTT CTT AGA GTC TTC CCA CCG ACC GAG CAC GTA AAA TTT AAC GTT ACG 
Val Leu Arg Val Phe Pro Pro Thr Glu His Val Lys Phe Asn Val Thr 
210 215 220 

TAT TCG TGG TAT GGG TTT GAT GTC AAA GAG GAG TGC GAA GAA GTG AAA 
Tyr Ser Trp Tyr Gly Phe Asp Val Lys Glu Glu Cys Glu Glu Val Lys 
225 230 235 

CTG TTC GAG CCG TGC GTA TAC CAT CCT ACA GAC GGC AAA TGT CAG TTT 
Leu Phe Glu Pro Cys Val Tyr His Pro Thr Asp Gly Lys Cys Gin Phe 
240 245 250 255 

CCC GCA ACC AAC CAG AGA TGC CTC ATA GGA TCT GTC TTG ATG GCG GAA 
Pro Ala Thr Asn Gin Arg Cys Leu He Gly Ser Val Leu Met Ala Glu 
260 265 270 

TTC TTG GGC GCG GCC TCT TTG CTG GAT TGT TCC CGC GAT ACT CTA GAA 
Phe Leu Gly Ala Ala Ser Leu Leu Asp Cys Ser Arg Asp Thr Leu Glu 
275 280 285 

GAC TGC CAC GAA AAT CGC GTG CCG AAC CTA CGG TTC GAT TCG CGA CTC 
Asp Cys His Glu Asn Arg Val Pro Asn Leu Arg Phe Asp Ser Arg Leu 
290 295 300 



• 



TCC GAG TCA CGC GCA GGC CTG GTG ATC ACT CCT CTT ATA GCC ATC CO. 
Ser Glu Ser Arg Ala Gly Leu Val lie Ser Pro Leu lie Ala lie Pro 
305 310 315 

rTT TTG ATT ATA GTC GTT TCC GAC GGA GAC ATT TTG GGA TGG AGC 
i£ Val £eu iTL lie Val Val Ser Asp Gly Asp lie Leu Gly Trp Ser 
320 325 330 335 

TAP ACG GTG CTC GGG AAA CGT AAC AGT CCG CGC GTA GTA GTC GAA ACG 
Tvr Thr Val Leu Gly Lys Arg Asn Ser Pro Arg Val Val Val Glu Thr 
y 340 345 350 

CAC ATG CCC TCG AAG GTC CCG ATG AAC AAA GTA GTA ATT GGC AGT CCC 
Us Set Pro Ser Lys Val Pro Met Asn Lys Val Val lie Gly Ser Pro 
355 360 365 

GGA CCA ATG GAC GAA ACG GGT AAC TAT AAA ATG TAC TTC GTC GTC GCG 
Gly Pro Met Asp Glu Thr Gly Asn Tyr Lys Met Tyr Phe Val Val Ala 
370 375 380 

GGG GTG GCC GCG ACG TGC GTA ATT CTT ACA TGC GCT CTG CTT CTG GGG 
Gly Val Ala Ala Thr Cys Val lie Leu Thr Cys Ala Leu Leu Val Gly 
385 390 395 

AAA AAG AAG TGC CCC GCG CAC CAA ATG GGT ACT TTT TCC AAG ACC GAA 
£yi £ys Lys Cys Pro Ala His Gin Met Gly Thr Phe Ser Lys Thr Glu 
400 405 410 415 

CCA TTG TAC GCG CCG CTC CCC AAA AAC GAG TTT GAG GCC GGC GGG CTT 
Pro Leu Tyr Ala Pro Leu Pro Lys Asn Glu Phe Glu Ala Gly Gly Leu 
420 425 430 

ACG GAC GAT GAG GAA GTG ATT TAT GAC GAA GTA TAC GAA CCC CTA TTT 
Thr Asp Asp Glu Glu Val lie Tyr Asp Glu Val Tyr Glu Pro Leu Phe 
435 440 445 

CGC GGC TAC TGT AAG CAG GAA TTC CGC GAA GAT GTG AAT ACC TTT TTC 
Arq Gly Tyr Cys Lys Gin Glu Phe Arg Glu Asp Val Asn Thr Phe Phe 
450 455 460 

GGT GCG GTC GTG GAG GGA GAA AGG GCC TTA AAC TTT AAA TCC GCC ATC 
Gly Ala Val Val Glu Gly Glu Arg Ala Leu Asn Phe Lys Ser Ala lie 
465 470 475 

GCA TCA ATG GCA GAT CGC ATC CTG GCA AAT AAA AGC GGC AGA AGG AAT 
Ala Ser Met Ala Asp Arg He Leu Ala Asn Lys Ser Gly Arg Arg Asn 
480 485 490 495 

ATG GAT AGC TAT TAG TTGGTC ATG CCT TTT AAG ACC AGA GGG GCC GAA 
Met Asp Ser Tyr Met Pro Phe Lys Thr Arg Gly Ala Glu 

500 1 5 

GAC GCG GCC GCG GGC AAG AAC AGG TTT AAG AAA TCG AGA AAT CGG GAA 
Asp Ala Ala Ala Gly Lys Asn Arg Phe Lys Lys Ser Arg Asn Arg Glu 
10 15 20 25 

ATC TTA CCG ACC AGA CTG CGT GGC ACC GGT AAG AAA ACT GCC GGA TTG 
He Leu Pro Thr Arg Leu Arg Gly Thr Gly Lys Lys Thr Ala Gly Leu 
30 35 40 

TCC AAT TAT ACC CAG CCT ATT CCC TGG AAC CCT AAA TTC TGC AGC GCG 
Ser Asn Tyr Thr Gin Pro He Pro Trp Asn Pro Lys Phe Cys Ser Ala 
45 50 55 

CGC GGG GAA TCT GAC AAC CAC GCG TGT AAA GAC ACT TTT TAT CGC AGG 
Arg Gly Glu Ser Asp Asn His Ala Cys Lys Asp Thr Phe Tyr Arg Arg 
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£ £ ^ys S - ^ "vS - - «» «p ser Pro 

75 J 80 85 

rar ACA CCC ATG CCT ACT GAG TAT GGG CGC GTG CCC TCC GCA AAG CGC 
Ss ?S Pro Me? Pro Thr Glu Tyr Gly Arg Val Pro Ser Ala Lys Arg 

90 95 
AAA AAA CTA TCA TCT TCA GAC TGC GAG GGC GCG CAC CAA CCC CTA GTA 
J£ L^ leu Ser Ser Ser Asp Cys Glu Gly Ala His Gin Pro Leu Val 
110 115 

TCC TGT AAA CTT CCG GAT TCT CAA GCA GCA CCG GCG CGA ACC TAT AGT 
Ser C% Lys Leu Pro Asp Ser Gin Ala Ala Pro Ala Arg Thr Tyr Ser 



125 
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140 145 

rrr GTC GAC GCT GTT GCG GAC TTA GAA ACQ CGC GCG GAA CTT CCT GGC 
GGC GTC GAC eel ^ t ^ ^ tht* atct Ala Glu Leu Pro Glv 

Gly 
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GCT ACG ACG GAA CAA ACG GAA AGT AAA AAT AAG CTC CCC AAC CAA CAA 
£a ?hr Thr Glu Gin Thr Glu Ser Lys Asn Lys Leu Pro Asn Gin Gin 
170 175 180 

Trr CGC CTG AAG CCG AAA CCC ACA AAC GAG CAC GTC GGA GGG GAG CGG 
Ser Sg Leu L^s Pro Lys Pro Thr Asn Glu His Val Gly Gly Glu Arg 

Trr CCC TCC GAA GGC ACG GTC GAG GCG CCA TCG CTC GGC ATC CTC TCG 
lys Pro Ser Su Sy Thr Val Glu Ala Pro Ser Leu Gly lie Leu Ser 
205 210 2 

CGC GTC GGG GCA GCG ATA GCA AAC GAG CTG GCT CGT ATG CGG AGG GCG 
Sg Sal Gly Ala Ala lie Ala Asn Glu Leu Ala Arg Met Arg Arg Ala 
220 225 230 

TGT CTT CCG CTC GCC GCG TCG GCG GCC GCT GCC GGA ATA GTG GCC TGG 
III III £o Leu Ala Ala Ser Ala Ala Ala Ala Gly lie Val Ala Trp 

235 240 
GCC GCG GCG AGG GCC TTG CAG AAA CAA GGG CGG TAG CAGTAATAAT 
Ala Ala Ala Arg Ala Leu Gin Lys Gin Gly Arg . 
250 255 260 

AACCACACAA ATATTGACAA TAATAAACGC GTACGCGG ATG AGT AAG TGT TAT 

Met Ser Lys Cys Tyr 
1 5 

TGT CTC GCG CGC CAT CTT TAT AAA AGC CCG CGT TGC GTG GGC CGG CGG 
Cys Leu Ala Arg His Leu Tyr Lys Ser Pro Arg Cys Val Gly Arg Arg 
10 15 

GTA GCA TTT GGA GGG TTG GCG ACC ATG TCG AGA CCT CCG ACG TCA CAT 
vli aS Phe Gly Gly Leu Ala Thr Met Ser Arg Pro Pro Thr Ser Hxs 
25 30 35 

TTG GAC TTA GCT TTC TCG GCG GCC TTT AGG GGC ACG GAC CTG CCC GGA 
22 2p Su Ala Phe Ser Ala Ala Phe Arg Gly Thr Asp Leu Pro Gly 
40 4 5 50 

GGG AGA TTC TGG CGG GCG TCG CAG AGT TGC GAT ATT TTC TTT TGG CCC 
Srg «2 Trp Arg Ala Ser Gin Ser Cys Asp lie Phe Phe Trp Pro 
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200 205 210 

rrr arc CTC CTG GTG ATA GCG CTG CCT CTG TCA AAC TTC ACC CTC CCG 
Sg itr Leu Leu Val lie Ala Leu Pro Leu Ser Asn Phe Thr Leu Pro 

215 220 225 

GAA GAC CTC CCC GAA GGT TCC CAA AAC TCG ATC CGC GCG TTC GTG I OCC 
Glu Asp Leu Pro Glu Gly Ser Gin Asn Ser lie Arg Ala Phe Val Ala 
230 235 240 

CAC CTC ATG AAC TGT GTA GCT ACC GAT AAG ATC ATG TCT CCG GAC GTG 
Ss £eu Met Asn Cys Val Ala Thr Asp Lys lie Met Ser Pro Asp Val 
250 255 

CGC GTC CCA GTC GAA GAA AGC TTT TAC AGC CAC TGT TTA AGG GAA ATC 
Sg SS Pro vai Glu Glu Ser Phe Tyr Ser His Cys Leu Arg Glu lie 
265 270 275 

ATT ATG TGC GAG AGA GCT TTT TGT TAC CCG TGC AAT CCC CCG CCA WA 
ill Met Cys Glu Arg Ala Phe Cys Tyr Pro Cys Asn Pro Pro Pro Lys 

280 285 290 

TGG TGA GCTCAGGGTC CCATTTACCC CCGCAACACC CTCTCGCCGC CAGGGCGCGC 
Trp . 
295 

GCTCTATCTT TCTCTATGTC CCGTCGCCAC CGGCCTAACC GAACGGTGGA ACGGGGCCGC 
CCGGGAAAGC CTAGCTCCGC ACAGACACAG ACAGACAAAC GGCCTCTCGA TTGCAACAAC 
GTGAAAAACA CACAATAATA TTACTTTATT TATTTCGCAG CGCTCGCGTG TCGCTCGTTT 
CTGGGGGGGG GGGGGGGTTG TGTTTCGCTG CCGCGGAGTG GGTGAGGGGG GAGAGTGGAC 



17123 
17183 
17243 
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GGAGGACAGT 
GGGCGTCATT 
CAAAGATCGA 
CCAGGTATTC 
GACTCTGCAC 
AGCAGCACCA 
GGTCGGCCGG 
TATTCTCGCT 
GCCAGAAATC 
TATTCGGGTC 
CTAGGCCGCG 
CAATCAGCGC 
CGGACGATAT 
CCGAGTCGGC 
GTACCGCGTG 
GCATGTTTTC 
GTGCTGGGCC 
GATGTCCGGC 
AAGTTCTGGA 
GAGGGGGAGG 
GCCGTGCAGG 
AGGAAGAAGA 
AAGCCTTCCG 
AGCAAATGGC 
CTCTGACTCG 
AGGCTGCTCT 
CTCCGATACG 



GTAAAAACCC 
CTCCCGTCAA 
GGCGCTCACT 
GCGACGGCGC 
GTAAATCGAG 
GTGCCCGGCC 
CAGGTGTCGG 
CTGCATGGTA 
GCCCTCGATC 
AGTGGGGGAG 
CGTCGCCCGG 
GTTGCGGCGG 
CTCGACTAAC 
CGACTCCGAC 
CGCAGTAATG 
CATTTAGGGA 
GAGTTGGACG 
GGTCGCAAAG 
TGAGAATGGA 
TTCTGCCCCG 
GGTCGAGGCC 
AAATGACAAC 
TGGGTCTTTG 
GGCGGCGCCG 
GTGGAGCTGT 
TCCTCCGACT 
TCCGAAAAGA 



GCGAGGTTGT 
CGGCGCCGCG 
TCGGGAGAGT 
ATCTCCGATT 
TCGTACAAAC 
GCCAGGAGAG 
GACAGCCAGG 
CGGCTCTCGT 
CCTTCCATTG 
TTGGCGACGC 
CTGGCCCCCA 
TCTCTCGCTG 
TCGCGCCAGA 
GAGGCGACAG 
TGATCGCTGT 
TGTGTCTAGA 
GCTACGGATT 
GGGCGCGCGT 
TCGAGCGGGC 
CGTCTCCTCT 
GCTTCTTCTT 
CTCCGTCTTT 
CGGGTGCCGC 
GAGAGCTGTC 
GTCCCGTGTC 
CCGGGTCGCT 
AAAACTTCTG 



-154- 

CAGGGACGCG 
CCAGCGCCAG 
TGGCGGCAGC 
GCGGGACTCT 
ACGCTTCGCG 
ACCGGGAGGT 
ATAACAGTTC 
CCTCGAAGCG 
CCTCCTCTAA 
TCTCCGTCGA 
GAAAAGCCGA 
ACGTGAACTC 
GCCGCGAGGC 
CGGCACCTTG 
CCATCAGCGC 
GGAGAGCGGT 
GCGCCGGGCG 
CCGCTTCTGC 
AATAAATGTC 
ATCTGCTCGT 
TTTTACTTCT 
TAAGAGTGCG 
GCACCGCAAT 
GTCAATAAAG 
CTCCTCGCCC 
CCAGCTCCTC 
GGAGAGCTCT 



GAGGTAAGGA 
CGCGATTCCA 
CGCGCCGGCG 
CCCCGTCCCG 
GCACGGCCGA 
CTCTCGCTCG 
TTTCTCGATC 
GTCACCGTCC 
TTCTCCCAAC 
CGCCCACCGC 
ACGTAGCCGT 
TCCGTAACCG 
GATCCGAATG 
CCGCCTGCGC 
CAGCGGGTAG 
CCGGGCGGTC 
TGAAGGGGGG 
GTGAAGGCTG 
CAGAGTAGGG 
CGAGGCCTCG 
CTCCTCGGAT 
CCTACCCGCC 
AACGCACGGA 
TCTAAGTCAG 
AAGTCCACTC 
CCGCGTGCCG 
TCGGGATCC 



AGCCGCTAGA 
GCGTCCCTCG 
CGCAAGGCGG 
ACGCAGAAAA 
GGGTGCCACA 
GCGGACGCGA 
ACGAAACGCA 
AGAGAGTCGA 
CCGTTGCCGT 
GGCCTCAAAC 
TCGTTGGCAG 
TCCTTGAGTT 
CACGGGTCTA 
TCGCCACGAG 
CACCCGTCGA 
TACTGCGGTT 
GGGGCGGCGG 
AGCGGGAAAG 
GGGTGGGAGG 
GCCTTGCGTC 
TCCTCGTCAG 
CTGGCGGCCG 
CGCGGGGGAT 
ATTGCGTGGG 
CCCGGCACCC 
GTTCTTCGTC 



17363 

17423 

17483 

17543 

17603 

17663 

17723 

17783 

17843 

17903 

17963 

18023 

18083 

18143 

18203 

18263 

18323 

18383 

18443 

18503 

18563 

18623 

18683 

18743 

18803 

18863 

18912 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.60: 
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Met Glu Asn Met Leu Asp Gly Cys Tyr Pro Leu Ala Leu Met Asp Ser 
1 5 10 15 

A^n His lie Thr Ala His Ala Val Pro Arg Gly Glu Arg Arg Arg Gin 
Asp his 25 30 

Glv Ala Ala Val Ala Ser Ser Glu Ser Ala Asp Ser Val Asp Pro Cys 
y 35 40 45 

He Arg He Ala Ser Arg Leu Trp Arg Glu Leu Val Glu He Ser Ser 

50 55 60 

Glu Leu Lys Asp Gly Tyr Gly Glu Phe Thr Ser Ala Arg Asp Arg Arg 
65 70 75 80 

Asn Ala Leu He Ala Ala Asn Glu Arg Leu Arg Ser Ala Phe Leu Gly 
85 9° 95 

Ala Ser Arg Ala Thr Arg Gly Leu Gly Leu Arg Pro Arg Trp Ala Ser 
100 105 

Thr Glu Ser Val Ala Asn Ser Pro Thr Asp Pro Asn Asn Gly Asn Gly 
115 120 125 

Leu Glv Glu Leu Glu Glu Ala Met Glu Gly He Glu Gly Asp Phe Trp 
130 135 140 

Leu Asp Ser Leu Asp Gly Asp Arg Phe Glu Asp Glu Ser Arg Thr Met 
145 150 155 l&O 

Gin Ser Glu Asn Met Arg Phe Val He Glu Lys Glu Leu Leu Ser Trp 
165 170 175 

Leu Ser Arg His Leu Pro Ala Asp Leu Ala Ser Ala Glu Arg Glu Thr 
180 185 190 

Ser Arq Ser Leu Leu Ala Ala Gly His Trp Cys Cys Leu Trp His Pro 
195 200 205 

Arq Pro Cys Arg Glu Ala Cys Leu Tyr Asp Ser He Tyr Val Gin Ser 
210 215 220 

Leu Phe Cys Val Gly Thr Gly Arg Val Pro Gin Ser Glu Met Arg Arg 
225 230 235 240 

Arq Glu Tyr Leu Ala Ala Leu Arg Ala Gly Ala Ala Ala Ala Asn Ser 
a 245 250 255 

Pro Glu Val Ser Ala Ser He Phe Ala Arg Asp Ala Gly He Ala Leu 
260 265 270 

Ala Leu Ala Arg Arg Arg . 
275 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
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Met Ser Lys Cys Tyr Cys Leu Ala Arg His Leu Tyr Lys Ser Pro Arg 
1 5 10 

Cvs val Gly Arg Arg Val Ala Phe Gly Gly Leu Ala Thr Met Ser Arg 

* 20 25 

Pro Pro Thr Ser His Leu Asp Leu Ala Phe Ser Ala Ala Phe Arg Gly 

35 40 45 

Thr Asp Leu Pro Gly Gly Arg Phe Trp Arg Ala Ser Gin Ser Cys Asp 

50 55 
lie Phe Phe Trp Pro Asp Leu Ala Ala Val lie Val Gin Ala Ala Arg 

Ala Tyr Phe Glu Gly Lys Glu Arg Leu Gly Ser Leu Gin Val Ala Glu 

Asp lie Thr Ala His Asp Pro Arg lie Ala Pro Ala Ala Lys Arg Ala 

Val Ala Ala Ala Val Gly Leu Trp Thr Ala Leu Ser Glu Leu Val Gly 
115 120 125 

Glv Pro Asn Gly Glu Leu Glu Ser Lys Val Trp Gly Lys Gin lie Pro 
130 135 140 

Arg Ala Ala Ala Trp Glu He Arg Asp Val Pro Lys Val Pro Val lie 
]_45 150 XSS 

Gly Pro Asp lie Leu Ser Phe Phe Ser Ala Ala Val Glu Leu Pro Val 
165 I 70 

Leu Tyr He Arg Ala Arg Gly Gly Ala His Ser Arg Ser Ala His Trp 
180 I 85 19 

Asn Asn Gin Ser Ser Ala Pro Ala Ala Gly Leu Ala Ala He Arg lie 
195 200 205 

Glv Met Glu Met Val Arg Ser Leu Leu Val He Ala Leu Pro Leu Ser 
210 215 220 

Asn Phe Thr Leu Pro Glu Asp Leu Pro Glu Gly Ser Gin Asn Ser lie 
225 230 235 240 

Arg Ala Phe Val Ala His Leu Met Asn Cys Val Ala Thr Asp Lys He 
245 250 255 

Met Ser Pro Asp Val Arg Val Pro Val Glu Glu Ser Phe Tyr Ser His 
260 265 270 

Cys Leu Arg Glu He He Met Cys Glu Arg Ala Phe Cys Tyr Pro Cys 
275 280 285 

Asn Pro Pro Pro Lys Trp . 
290 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
Met Ala Pro Val Lys Val Thr lie Val Ser Ala Val Asp Ser His Tyr 

Lys Leu Pro Asn Ser Arg Phe Glu Leu Ser Asp Ser Gly Trp Lys Glu 

20 25 
Leu Val His Ala Val Lys Thr Met Ala Ser Tyr Asp Arg Pro Ser Thr 

Leu Ser val lie Val Arg Pro Ala Ser Leu Tyr Glu Val Ser Gly Glu 

50 55 60 

Leu Phe Ser Leu Pro Arg Met Cys Arg Pro Val lie Arg Phe Gly Glu 

65 70 75 

Gly Gly Asp Pro Pro Gly Val Ser Pro Glu Trp Ser Gly Leu Asp Ala 

Gly Phe Tyr His Leu Ser Ser Gly Ala Tyr Ala Ala Lys Glu Phe His 

100 105 
Leu Trp Val Leu Gly Thr Ala Asp lie Cys Met Ala Ala Leu Asn Leu 

115 120 
Pro Ala Pro Lys Thr Phe Leu lie Thr Glu Thr Gly Gly Lys Asn Phe 

130 135 
Glu Arg Gly Val Glu lie Phe Leu Val Asn Gly Asp Lys Thr Thr Leu 
1 45 150 155 

Ser Leu Ser His Pro Ser Val Trp Thr Thr Leu Ala Pro Ser Ser Leu 

165 1 ?0 
Arg Thr Pro Trp Pro Tyr Ser Thr Val Lys Phe Leu Lys Val Lys Pro 

180 185 190 

Asn Ser Ala Ala Tyr Cys Val Ser Asp Ser Asp Asp Gly Glu Arg Gin 



195 



Pro Lys Phe Phe Leu Gly Ser Leu Phe Lys Ser Lys Lys Pro Arg Ser 
„; „ 5H 220 



Pro Arg Arg Arg Arg 
225 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

Met Arg Phe Arg Arg lie Cys Ser Arg Ser Arg Ala Glu Lys Arg Arg 

1 5 1° lb 

Arg Thr Thr Glu Asn Pro Leu Thr Ser Lys Arg Val Cys Val Leu Asp 
20 25 30 

Ser Phe Ser Arg Thr Met Ser Leu Arg Pro Tyr Ala Glu He Leu Pro 
40 45 
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Thr Ala Glu Gly Val Glu Arg Leu Ala Glu Leu Val Ser Val Thr Met 

50 55 60 

Thr Glu Arg Ala Glu Pro Val Thr Glu Asn Thr Ala Val Asn Ser lie 

65 70 7 

Pro Pro Ala Asn Glu Asn Gly Gin Asn Phe Ala Tyr Ala Gly Asp Gly 

Pro Ser Thr Thr Glu Lys Val Asp Gly Ser His Thr Asp Phe Asp Glu 
100 105 110 

Ala Ser Ser Asp Tyr Ala Gly Pro Val Pro Leu Ala Gin Thr Arg Leu 
U5 120 125 

Lys His Ser Asp Glu Phe Leu Gin His Phe Arg Val Leu Asp Asp Leu 
7 130 135 140 

Val Glu Gly Ala Tyr Gly Phe lie Cys Asp Val Arg Arg Tyr Thr Glu 
145 150 155 

Glu Glu Gin Arg Arg Arg Gly Val Asn Ser Thr Asn Gin Gly Lys Ser 

165 I 70 ■ L/s 

Lys Cys Lys Arg Leu lie Ala Lys Tyr Val Lys Asn Gly Thr Arg Ala 

Ala Ser Gin Leu Glu Asn Glu He Leu Val Leu Gly Arg Leu Asn His 
195 200 205 

Glu Asn Val Leu Lys lie Gin Glu lie Leu Arg Tyr Pro Asp Asn Thr 
210 215 220 

Tyr Met Leu Thr Gin Arg Tyr Gin Phe Asp Leu Tyr Ser Tyr Met Tyr 
225 230 235 240 

Asp Glu Ala Phe Asp Trp Lys Asp Ser Pro Met Leu Lys Gin Thr Arg 
245 250 255 

Arq He Met Lys Gin Leu Met Ser Ala Val Ser Tyr He His Ser Lys 
260 265 270 

Lys Leu He His Arg Asp He Lys Leu Glu Asn He Phe Leu Asn Cys 
275 280 285 

Asp Gly Lys Thr Val Leu Gly Asp Phe Gly Thr Val Thr Pro Phe Glu 
290 295 300 

Asn Glu Arg Glu Pro Phe Glu Tyr Gly Trp Val Gly Thr Val Ala Thr 
305 310 315 320 

Asn Ser Pro Glu He Leu Ala Arg Asp Ser Tyr Cys Glu He Thr Asp 
325 330 335 

He Trp Ser Cys Gly Val Val Leu Leu Glu Met Val Ser His Glu Phe 
340 345 350 

Cvs Pro He Gly Asp Gly Gly Gly Asn Pro His Gin Gin Leu Leu Lys 
355 360 365 

Val He Asp Ser Leu Ser Val Cys Asp Glu Glu Phe Pro Asp Pro Pro 
370 375 380 

Cys Asn Leu Tyr Asn Tyr Leu His Tyr Ala Ser He Asp Arg Ala Gly 
3 | 5 390 395 400 
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His Thr Val Pro Ser Leu lie Arg Asn Leu His Leu Pro Ala Asp Val 
405 410 415 

Glu Tvr Pro Leu Val Lys Met Leu Thr Phe Asp Trp Arg Leu Arg Pro 
420 425 430 

Ala A ia Glu Val Leu Ala Met Pro Leu Phe Ser Ala Glu Glu Glu 
ser axb *^ 44Q 445 

■ He His Gly Lys His Lys Pro He Arg Pro Glu 
455 4 60 

He Arg Ala Arg Val Pro Arg Ser Met Ser Glu Gly 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Met Thr Leu Pro His Arg Leu Thr Lys Arg Pro Phe Ala Arg Arg Phe 
1 5 10 15 

Cvs Ser Val Phe Val He His Tyr Ser Glu Thr Lys Leu Asp Arg Tyr 
20 25 30 

Asn Lys Thr Met Leu Leu Tyr Arg Pro Asp Ser Thr Met Arg His Ser 
35 40 45 

Gly Gly Asp Ala Asn His Arg Gly He Arg Pro Arg Arg Lys Ser He 
50 55 SO 

Gly Ala Phe Ser Ala Arg Glu Lys Thr Gly Lys Arg Asn Ala Leu Thr 
65 70 75 80 

Glu Ser Ser Ser Ser Ser Asp Met Leu Asp Pro Phe Ser Thr Asp Lys 
85 90 95 

Glu Phe Gly Gly Lys Trp Thr Val Asp Gly Pro Ala Asp He Thr Ala 
100 105 HO 

Glu Val Leu Ser Gin Ala Trp Asp Val Leu Gin Leu Val Lys His Glu 
115 120 125 

Asp Ala Glu Glu Glu Arg Val Thr Tyr Glu Ser Lys Pro Thr Pro He 
130 135 140 

Gin Pro Phe Asn Ala Trp Pro Asp Gly Pro Ser Trp Asn Ala Gin Asp 
145 150 155 160 

Phe Thr Arg Ala Pro He Val Tyr Pro Ser Ala Glu Val Leu Asp Ala 
165 170 175 

Glu Ala Leu Lys Val Gly Ala Phe Val Ser Arg Val Leu Gin Cys Val 
180 185 190 

Pro Phe Thr Arg Ser Lys Lys Ser Val Thr Val Arg Asp Ala Gin Ser 
195 200 205 



Phe Leu Gly Asp Ser Phe Trp Arg lie Met Gin Asn Val Tyr Thr Val 

210 215 
Val Leu Arg Gin His He Thr Arg Leu Arg His Pro Ser Ser Lys Ser 
225 230 

He Val Asn Cys Asn Asp Pro Leu Trp Tyr Ala Tyr Ala Asn Gin Phe 
245 250 

His Trp Arg Gly Met Arg Val Pro Ser Leu Lys Leu Ala Ser Pro Pro 

260 265 2 

Glu Glu Asn lie Gin His Gly Pro Met Ala Ala Val Phe Arg Asn Ala 

275 280 2 

Gly Ala Gly Leu Phe Leu Trp Pro Ala Met Arg Ala Ala Phe Glu Glu 
290 295 300 



Arg Asp Lys Arg Leu Leu Arg 
305 310 



Ala Cys Leu Ser Ser Leu Asp He Met 

315 320 



Asp Ala Ala Val Leu Ala Ser Phe Pro Phe Tyr Trp Arg Gly Val Gin 
325 330 335 

Asp Thr Ser Arg Phe Glu Pro Ala Leu Gly Cys Leu Ser Glu Tyr Phe 
340 345 350 

Ala Leu Val Val Leu Leu Ala Glu Thr Val Leu Ala Thr Met Phe Asp 
355 360 365 

His Ala Leu Val Phe Met Arg Ala Leu Ala Asp Gly Asn Phe Asp Asp 
370 375 380 

Tyr Asp Glu Thr Arg Tyr He Asp Pro Val Lys Asn Glu Tyr Leu Asn 
385 390 395 

Gly Ala Glu Gly Thr Leu Leu Arg Gly He Val Ala Ser Asn Thr Ala 



Leu Ala Val Val Cys Ala Asn Thr Tyr Ser Thr He Arg Lys Leu Pro 
420 425 430 

Ser Val Ala Thr Ser Ala Cys Asn Val Ala Tyr Arg Thr Glu Thr Leu 
435 440 445 

Lys Ala Arg Arg Pro Gly Met Ser Asp He Tyr Arg He Leu Gin Lys 
450 455 460 

Glu Phe Phe Phe Tyr He Ala Trp Leu Gin Arg Val Ala Thr His Ala 
465 470 475 480 

Asn Phe Cys Leu Asn He Leu Lys Arg Ser Val Asp Thr Gly Ala Pro 
485 490 495 

Pro Phe Leu Phe Arg Ala Ser Ser Glu Lys Arg Leu Gin Gin Leu Asn 
500 505 510 

Lys Met Leu Cys Pro Leu Leu Val Pro He Gin Tyr Glu Asp Phe Ser 
515 520 525 

Lys Ala Met Gly Ser Glu Leu Lys Arg Glu Lys Leu Glu Thr Phe Val 
530 535 540 

Lys Ala He Ser Ser Asp Arg Asp Pro Arg Gly Ser Leu Arg Phe Leu 
545 550 555 560 
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II. ser Asp His A!a Arg alu He He Ala Asp Gly val Arg Ph. Ly. 

565 570 
Pro Val He Asp Olu Pro Val Arg Ala Ser Val Ala Leu Ser Thr Ala 

580 585 
Ala Ala oly LY- val I*. Ala «g Arg « Thr ser val Arg Ala Pro 

595 600 
Val Pro Gly Ala Gly Ala Val Ser Ala Arg Arg Lys Ser Glu He . 



610 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 292 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
Met Ser Gly Phe Ser Asn He Gly Ser lie Ala Thr Val Ser Leu Val 

1 5 10 

C ys Ser Leu Leu Cys Ala Ser Val Leu Gly Ala Pro Val Leu Asp Gly 

20 25 
Leu Glu Ser Ser Pro Phe Pro Phe Gly Gly Lys lie lie Ala Gin Ala 

35 40 45 

Cys Asn Arg Thr Thr lie Glu Val Thr Val Pro Trp Ser Asp Tyr Ser 

50 55 
Gly Arg Thr Glu Gly Val Ser Val Glu Val Lys Trp Phe Tyr Gly Asn 

Ser Asn Pro Glu Ser Phe Val Phe Gly Val Asp Ser Glu Thr Gly Ser 

85 90 
Gly His Glu Asp Leu Ser Thr Cys Trp Ala Leu lie His Asn Leu Asn 

100 105 
Ala Ser Val Cys Arg Ala Ser Asp Ala Gly He Pro Asp Phe Asp Lys 
115 I 20 125 



Gin Cys Glu Lys Val Gin Arg Arg Leu Arg Ser Gly Val Glu Leu Gly 
130 135 



Ser Tyr Val Ser Gly Asn Gly Ser Leu Val Leu Tyr Pro Gly Met Tyr 
145 150 155 

Asp Ala Gly lie Tyr Ala Tyr Gin Leu Ser Val Gly Gly Lys Gly Tyr 

Thr Gly Ser Val Tyr Leu Asp Val Gly Pro Asn Pro Gly Cys His Asp 

180 185 
Gin Tyr Gly Tyr Thr Tyr Tyr Ser Leu Ala Asp Glu Ala Ser Asp Leu 
tqc 200 



Ser Ser Tyr Asp Val Ala Ser Pro Glu Leu Asp Gly Pro Met Glu Glu 
210 215 



Asp Tyr Ser Asn Cys Leu Asp Met Pro Pro Leu Arg Pro Trp Thr Thr 
225 230 

Val Cys Ser His Asp Val Glu Olu Gin Glu Asn Ala Thr Asp Glu Leu 

Tyr Leu Trp Asp Glu Glu Cys Ala Gly Pro Leu Asp Glu Tyr Val Asp 

260 . 265 
Glu Arg Ser Glu Thr Met Pro Arg Met Val Val Phe Ser Pro Pro Ser 

Thr Leu Gin Gin 
290 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Met Gly Thr Met Leu Val Leu Arg Leu Phe Leu Leu Ala Val Ala Asp 

Ala Ala Leu Pro Thr Gly Arg Phe Cys Arg Val Trp Lys Val Pro Pro 

20 25 
Gly Gly Thr lie Gin Glu Asn Leu Ala Val Leu Ala Glu Ser Pro Val 

Thr Gly His Ala Thr Tyr Pro Pro Pro Glu Gly Ala Val Ser Phe Gin 

50 55 60 

lie Phe Ala Asp Thr Pro Thr Leu Arg lie Arg Tyr Gly Ala Thr Glu 

65 70 75 

Asp Glu Leu Ala Leu Glu Arg Gly Thr Ser Ala Ser Asp Ala Asp Asn 

Val Thr Phe Ser Leu Ser Tyr Arg Pro Arg Pro Glu lie His Gly Ala 
100 i° 5 

Tyr Phe Thr lie Gly Val Phe Ala Thr Gly Gin Ser Thr Glu Ser Ser 
120 



115 



Tyr Ser Val lie Ser Arg Val Leu Val Asn Ala Ser Leu Glu Arg Ser 

130 135 140 

Val Arg Leu Glu Thr Pro Cys Asp Glu Asn Phe Leu Gin Asn Glu Pro 
145 150 

Thr Trp Gly Ser Lys Arg Trp Leu Gly Pro Pro Ser Pro Tyr Val Arg 

Asp Asn Asp Val Ala Val Leu Thr Lys Ala Gin Tyr lie Gly Glu Cys 
180 185 

Tyr Ser Asn Ser Ala Ala Gin Thr Gly Leu Thr Ser Leu Asn Met Thr 
195 200 
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Phe Phe Tyr Ser Pro Lys Arg lie Val Asn Val Thr Trp Thr Thr Gly 

210 215 
Gly Pro Ser Pro Ser Arg He Thr Val Tyr Ser Ser Arg Glu Asn Gly 
225 230 

Gin Pro Val Leu Arg Asn Val Ser Asp Gly Phe Leu Val Lys Tyr Thr 

245 ■ ii>u 
Pro Asp He Asp Gly Arg Ala Met lie Asn Val lie Ala Asn Tyr Ser 

260 2b = 
Pro Ala Asp Ser Gly Ser Val Leu Ala Phe Thr Ala Phe Arg Glu Gly 

275 280 
Lys Leu Pro Ser Ala He Gin Leu His Arg lie Asp Met Ser Gly Thr 

290 295 
Glu Pro Pro Gly Thr Glu Thr Thr Phe Asp Cys Gin Lys Met lie Glu 
305 310 315 

Thr Pro Tyr Arg Ala Leu Gly Ser Asn Val Pro Arg Asp Asp Ser lie 

325 - JJU 
Arg Pro Gly Ala Thr Leu Pro Pro Phe Asp Thr Ala Ala Pro Asp Phe 

340 345 
Asp Thr Gly Thr Ser Pro Thr Pro Thr Thr Val Pro Glu Pro Ala lie 

Thr Thr Leu lie Pro Arg Ser Thr Ser Asp Met Gly Phe Phe Ser Thr 

370 375 380 

Ala Arg Ala Thr Gly Ser Glu Thr Leu Ser Val Pro Val Gin Glu Thr 
385 390 



Asp Arg Thr Leu Ser Thr Thr Pro Leu Thr Leu Pro Leu Thr Pro Gly 

405 410 
Glu Ser Glu Asn Thr Leu Phe Pro Thr Thr Ala Pro Gly lie Ser Thr 

420 425 
Glu Thr Pro Ser Ala Ala His Glu Thr Thr Gin Thr Gin Ser Ala Glu 
435 440 44 => 

Thr Val Val Phe Thr Gin Ser Pro Ser Thr Glu Ser Glu Thr Ala Arg 
450 4 55 460 



Ser Gin Ser Gin Glu Pro Trp Tyr Phe Thr Gin Thr Pro Ser Thr Glu 
465 47 ° 475 

Gin Ala Ala Leu Thr Gin Thr Gin lie Ala Glu Thr Glu Ala Leu Phe 
485 490 

Thr Gin Thr Pro Ser Ala Glu Gin Met Thr Phe Thr Gin Thr Pro Gly 
500 505 510 

Ala Glu Thr Glu Ala Pro Ala Gin Thr Pro Ser Thr lie Pro Glu lie 
515 520 525 

Phe Thr Gin Ser Arg Ser Thr Pro Pro Glu Thr Ala Arg Ala Pro Ser 
530 535 540 

Ala Ala Pro Glu Val Phe Thr Gin Ser Ser Ser Thr Val Thr Glu Val 
545 550 555 Sbu 



Phe Thr Gin Thr Pro Ser Thr Val Pro Lys Thr Thr Leu Ser Ser Ser 
565 570 575 

Thr Glu Pro Ala He Phe Thr Arg Thr Gin Ser Ala Gly Thr Glu Ala 
580 585 590 

Phe Thr Gin Thr Ser Ser Ala Glu Pro Asp Thr Met Arg Thr Gin Ser 
595 600 605 

Thr Glu Thr His Phe Phe Thr Gin Ala Pro Ser Thr Val Pro Lys Ala 

610 615 620 

Thr Gin Thr Pro Ser Thr Glu Pro Glu Val Leu Thr Gin Ser Pro Ser 
625 630 635 640 

Thr Glu Pro Val Pro Phe Thr' Arg Thr Leu Gly Ala Glu Pro Glu He 
645 650 655 

Thr Gin Thr Pro Ser Ala Ala Pro Glu Val Tyr Thr Arg Ser Ser Ser 
660 670 

Thr Met Pro Glu Thr Ala Gin Ser Thr Pro Leu Ala Ser Gin Asn Pro 
675 680 685 

Thr Ser Ser Gly Thr Gly Thr His Asn Thr Glu Pro Arg Thr Tyr Pro 
690 695 700 

Val Gin Thr Thr Pro His Thr Gin Lys Leu Tyr Thr Glu Asn Lys Thr 
705 710 715 720 

Leu Ser Phe Pro Thr Val Val Ser Glu Phe His Glu Met Ser Thr Ala 
725 730 735 

Glu Ser Gin Thr Pro Leu Leu Asp Val Lys He Val Glu Val Lys Phe 
740 745 750 

Ser Asn Asp Gly Glu Val Thr Ala Thr Cys Val Ser Thr Val Lys Ser 
755 760 765 

Pro Tyr Arg Val Glu Thr Asn Trp Lys Val Asp Leu Val Asp Val Met 
770 775 780 

Asp Glu He Ser Gly Asn Ser Pro Ala Gly Val Phe Asn Ser Asn Glu 
785 790 795 800 

Lys Trp Gin Lys Gin Leu Tyr Tyr Arg Val Thr Asp Gly Arg Thr Ser 
* 805 810 815 

Val Gin Leu Met Cys Leu Ser Cys Thr Ser His Ser Pro Glu Pro Tyr 
820 825 830 

Cys Leu Phe Asp Thr Ser Leu He Ala Arg Glu Lys Asp He Ala Pro 
835 840 845 

Glu Leu Tyr Phe Thr Ser Asp Pro Gin Thr Ala Tyr Cys Thr He Thr 
850 855 860 

Leu Pro Ser Gly Val Val Pro Arg Phe Glu Trp Ser Leu Asn Asn Val 
865 870 875 880 

Ser Leu Pro Glu Tyr Leu Thr Ala Thr Thr Val Val Ser His Thr Ala 
885 890 895 

Gly Gin Ser Thr Val Trp Lys Ser Ser Ala Arg Ala Gly Glu Ala Trp 
900 905 910 
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Ile Ser Gly Arg Gly Gly Asn He 
915 920 

Asp Gly Thr Arg Val Thr Thr Arg 
930 935 

Trp He Ala Val Glu Asn Gly Ala 
945 950 

Phe Ser Gly Leu Val Ser Gly Leu 
965 

Ala Thr Leu Trp Thr Ala He Tyr 
980 



Tyr Glu Cvs Thr Val Leu He Ser 
925 

Lys Glu Arg Cys Leu Thr Asn Thr 
940 

Ala Gin Ala Gin Leu Tyr Ser Leu 

955 960 

Cys Gly Ser He Ser Ala Leu Tyr 
970 975 



Phe 
985 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

Met His Arg Pro His Leu Arg Arg His Ser Arg Tyr Tyr Ala Lys Gly 
1 5 10 15 

Glu Val Leu Asn Lys His Met Asp Cys Gly Gly Lys Arg Cys Cys Ser 
20 25 30 

Gly Ala Ala Val Phe Thr Leu Phe Trp Thr Cys Val Arg He Met Arg 
35 40 45 

Glu His He Cys Phe Val Arg Asn Ala Met Asp Arg His Leu Phe Leu 
50 55 60 

Arg Asn Ala Phe Trp Thr He Val Leu Leu Ser Ser Phe Ala Ser Gin 
65 70 75 80 

Ser Thr Ala Ala Val Thr Tyr Asp Tyr He Leu Gly Arg Arg Ala Leu 
85 90 95 

Asp Ala Leu Thr He Pro Ala Val Gly Pro Tyr Asn Arg Tyr Leu Thr 
100 105 HO 

Ara Val Ser Arg Gly Cys Asp Val Val Glu Leu Asn Pro He Ser Asn 
a 115 120 125 

Val Asp Asp Met He Ser Ala Ala Lys Glu Lys Glu Lys Gly Gly Pro 
130 135 140 

Phe Glu Ala Ser Val Val Trp Phe Tyr Val He Lys Gly Asp Asp Gly 
145 150 155 160 

Glu Asp Lys Tyr Cys Pro He Tyr Arg Lys Glu Tyr Arg Glu Cys Gly 
165 170 175 

Asp Val Gin Leu Leu Ser Glu Cys Ala Val Gin Ser Ala Gin Met Trp 
180 185 190 

Ala Val Asp Tyr Val Pro Ser Thr Leu Val Ser Arg Asn Gly Ala Gly 
195 200 205 
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Leu Thr lie Phe Ser Pro Thr Ala Ala Leu Ser Gly Gin Tyr Leu Leu 

210 215 220 

Thr Leu Lys He Gly Arg Phe Ala Gin Thr Ala Leu Val Thr Leu Glu 

225 230 23 

Val Asn Asp Arg Cys Leu Lys He Gly Ser Gin Leu Asn Phe Leu Pro 
245 250 = = 

Ser Lys Cys Trp Thr Thr Glu Gin Tyr Gin Thr Gly Phe Gin Gly Glu 

260 265 
His Leu Tyr Pro He Ala Asp Thr Asn Thr Arg His Ala Asp Asp Val 

275 280 
Tyr Arg Gly Tyr Glu Asp He Leu Gin Arg Trp Asn Asn Leu Leu Arg 

290 295 300 

Lys Lys Asn Pro Ser Ala Pro Asp Pro Arg Pro Asp Ser Val Pro Gin 
305 310 

Glu lie Pro Ala Val Thr Lys Lys Ala Glu Gly Arg Thr Pro Asp Ala 
325 33 0 

Glu Ser Ser Glu Lys Lys Ala Pro Pro Glu Asp Ser Glu Asp Asp Met 
340 345 3b 

Gin Ala Glu Ala Ser Gly Glu Asn Pro Ala Ala Leu Pro Glu Asp Asp 
355 360 365 

Glu Val Pro Glu Asp Thr Glu His Asp Asp Pro Asn Ser Asp Pro Asp 

370 375 380 

Tyr Tyr Asn Asp Met Pro Ala Val lie Pro Val Glu Glu Thr Thr Lys 
385 3 *0 395 

Ser Ser Asn Ala Val Ser Met Pro lie Phe Ala Ala Phe Val Ala Cys 
405 410 41b 

Ala Val Ala Leu Val Gly Leu Leu Val Trp Ser He Val Lys Cys Ala 
420 425 430 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 



Met 



Ala Ser Leu Leu Gly Thr Leu Ala Leu Leu Ala Ala Thr Leu Ala 



Pro Phe Gly Ala Met Gly He Val He Thr Gly Asn His Val Ser Ala 
20 25 30 

Arg He Asp Asp Asp His He Val He Val Ala Pro Arg Pro Glu Ala 



35 
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Thr lie Gin Leu Gin Leu Phe Phe Met Pro Gly Gin Arg Pro His Lys 

50 55 
Pro Tyr Ser Gly Thr Val Arg Val Ala Phe Arg Ser Asp He Thr Asn 

65 70 
Gin Cys Tyr Gin Glu Leu Ser Glu Glu Arg Phe Glu Asn Cys Thr His 

Arg Ser Ser Ser Val Phe Val Gly Cys Lys Val Thr Glu Tyr Thr Phe 

100 105 
Ser Ala Ser Asn Arg Leu Thr Gly Pro Pro His Pro Phe Lys Leu Thr 

lie Arg Asn Pro Arg Pro Asn Asp Ser Gly Met Phe Tyr Val He Val 

130 135 
Arg Leu • Asp Asp Thr Lys Glu Pro lie Asp Val Phe Ala lie Gin Leu 
145 150 

Ser Val Tyr Gin Phe Ala Asn Thr Ala Ala Thr Arg Gly Leu Tyr Ser 

165 170 
Lys Ala Ser Cys Arg Thr Phe Gly Leu Pro Thr Val Gin Leu Glu Ala 

180 185 
Tyr Leu Arg Thr Glu Glu Ser Trp Arg Asn Trp Gin Ala Tyr Val Ala 

195 200 
Thr Glu Ala Thr Thr Thr Ser Ala Glu Ala Thr Thr Pro Thr Pro Val 

210 215 220 

Thr Ala Thr Ser Ala Ser Glu Leu Glu Ala Glu His Phe Thr Phe Pro 
225 230 235 

Trp Leu Glu Asn Gly Val Asp His Tyr Glu Pro Thr Pro Ala Asn Glu 
245 250 " 3 

Asn Ser Asn Val Thr Val Arg Leu Gly Thr Met Ser Pro Thr Leu lie 
260 265 

Gly Val Thr Val Ala Ala Val Val Ser Ala Thr lie Gly Leu Val lie 
275 280 285 

Val He Ser He Val Thr Arg Asn Met Cys Thr Pro His Arg Lys Leu 
290 295 300 

Asp Thr Val Ser Gin Asp Asp Glu Glu Arg Ser Gin Thr Arg Arg Glu 
305 310 315 

ser Arg Lys Phe Gly Pro Met Val Ala Cys Glu lie Asn Lys Gly Ala 
325 330 

Asp Gin Asp Ser Glu Leu Val Glu Leu Val Ala lie Val Asn Pro Ser 
340 345 350 

Ala Leu Ser Ser Pro Asp Ser He Lys Met . 
355 360 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



-168- 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Met Asn Met Leu Val He Val Leu Ala Ser Cys Leu Ala Arg Leu Thr 

! 5 10 ^ 

Phe Ala Thr Arg His Val Leu Phe Leu Glu Gly Thr Gin Ala Val Leu 

20 25 JU 

Gly Glu Asp Asp Pro Arg Asn Val Pro Glu Gly Thr Val lie Lys Trp 

35 40 45 

Thr Lys Val Leu Arg Asn Ala Cys Lys Met Lys Ala Ala Asp Val Cys 

50 55 
Ser Ser Pro Asn Tyr Cys Phe His Asp Leu lie Tyr Asp Gly Gly Lys 

65 70 75 

Lys Asp Cys Pro Pro Ala Gly Pro Leu Ser Ala Asn Leu Val lie Leu 

Leu Lys Arg Gly Glu Ser Phe Val Val Leu Gly Ser Gly Leu His Asn 

100 105 
Ser Asn lie Thr Asn lie Met Trp Thr Glu Tyr Gly Gly Leu Leu Phe 

115 120 
Asp Pro Val Thr Arg Ser Asp Glu Gly lie Tyr Phe Arg Arg lie Ser 

130 135 1 

Gin Pro Asp Leu Ala Met Glu Thr Thr Ser Tyr Asn Val Ser Val Leu 
145 150 155 

Ser His val Asp Glu Lys Ala Pro Ala Pro His Glu Val Glu lie Asp 
165 170 

Thr He Lys Pro Ser Glu Ala His Ala His Val Glu Leu Gin Met Leu 
180 I 85 19 

Pro Phe His Glu Leu Asn Asp Asn Ser Pro Thr Tyr Val Thr Pro Val 
195 200 205 

Leu Arg Val Phe Pro Pro Thr Glu His Val Lys Phe Asn Val Thr Tyr 
210 215 220 

Ser Trp Tyr Gly Phe Asp Val Lys Glu Glu Cys Glu Glu Val Lys Leu 
225 230 235 240 

Phe Glu Pro Cys Val Tyr His Pro Thr Asp Gly Lys Cys Gin Phe Pro 
245 250 255 

Ala Thr Asn Gin Arg Cys Leu lie Gly Ser Val Leu Met Ala Glu Phe 
260 265 270 

Leu Gly Ala Ala Ser Leu Leu Asp Cys Ser Arg Asp Thr Leu Glu Asp 
275 280 285 

Cvs His Glu Asn Arg Val Pro Asn Leu Arg Phe Asp Ser Arg Leu Ser 
290 295 300 

Glu Ser Arg Ala Gly Leu Val He Ser Pro Leu He Ala He Pro Lys 
305 310 315 320 

Val Leu He He Val Val Ser Asp Gly Asp He Leu Gly Trp Ser Tyr 
325 330 335 
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Th, val Leu aly Lys Arg As„ ser Pre Arg V,! val V,! Glu Thr His 

340 34b 
Met Pro Ser Lys Val Pro Met Asn Lys Val Val lie Gly Ser Pro Gly 

Pro Met Asp Glu Thr Gly Asn Tyr Lys Met Tyr Phe Val Val Ala Gly 

val Ala Ala Thr Cys Val He Leu Thr Cys Ala Leu Leu Val Gly Lys 
385 390 395 

Lys Lys Cys Pro Ala His Gin Met Gly Thr Phe Ser Lys Thr Glu Pro 

Leu Tyr Ala Pro Leu Pro Lys Asn Glu Phe Glu Ala Gly Gly Leu Thr 

420 425 
Asp Asp Glu Glu Val lie Tyr Asp Glu Val Tyr Glu Pro Leu Phe Arg 

Gly Tyr Cys Lys Gin Glu Phe Arg Glu Asp Val Asn Thr Phe Phe Gly 

450 455 
Ala Val val Glu Gly Glu Arg Ala Leu Asn Phe Lys Ser Ala He Ala 
465 470 475 

Ser Met Ala Asp Arg He Leu Ala Asn Lys Ser Gly Arg Arg Asn Met 
485 490 

Asp Ser Tyr 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:70: 

Met Pro Phe Lys Thr Arg Gly Ala Glu Asp Ala Ala Ala Gly Lys Asn 

X 5 1° ib 

Arg Phe Lys Lys Ser Arg Asn Arg Glu lie Leu Pro Thr Arg Leu Arg 

20 25 JU 

Gly Thr Gly Lys Lys Thr Ala Gly Leu Ser Asn Tyr Thr Gin Pro lie 
35 40 45 

Pro Trp Asn Pro Lys Phe Cys Ser Ala Arg Gly Glu Ser Asp Asn His 

50 55 60 

Ala Cys Lys Asp Thr Phe Tyr Arg Arg Thr Cys Cys Ala Ser Arg Ser 
65 70 75 

Thr Val Ser Ser Gin Pro Asp Ser Pro His Thr Pro Met Pro Thr Glu 
85 90 95 

Tyr Gly Arg Val Pro Ser Ala Lys Arg Lys Lys Leu Ser Ser Ser Asp 
100 105 1 



Cys Glu Gly Ala His Gin Pro Leu Val Ser Cys Lys Leu Pro Asp Ser 
115 120 12b 

Gin Ala Ala Pro Ala Arg Thr Tyr Ser Ser Ala Gin Arg Tyr Thr Val 
130 135 140 

Asp Glu Val Ser Ser Pro Thr Pro Pro Gly Val Asp Ala Val Ala Asp 
145 150 155 ° 

L«u Glu Thr Arg Ala Glu Leu Pro Gly Ala Thr Thr Glu Gin Thr Glu 

165 170 ^ 75 

Ser Lys Asn Lys Leu Pro Asn Gin Gin Ser Arg Leu Lys Pro Lys Pro 
180 185 

Thr Asn Glu His Val Gly Gly Glu Arg Cys Pro Ser Glu Gly Thr Val 
195 200 205 

Glu Ala Pro Ser Leu Gly He Leu Ser Arg Val Gly Ala Ala lie Ala 

210 215 220 

Asn Glu Leu Ala Arg Met Arg Arg Ala Cys Leu Pro Leu Ala Ala Ser 
225 230 235 240 

Ala Ala Ala Ala Gly He Val Ala Trp Ala Ala Ala Arg Ala Leu Gin 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Ser Lys Cys Tyr Cys Leu Ala Arg His Leu Tyr Lys Ser Pro Arg 
1 5 10 15 

Cys Val Gly Arg Arg Val Ala Phe Gly Gly Leu Ala Thr Met Ser Arg 
20 25 30 

Pro Pro Thr Ser His Leu Asp Leu Ala Phe Ser Ala Ala Phe Arg Gly 
35 40 45 

Thr Asp Leu Pro Gly Gly Arg Phe Trp Arg Ala Ser Gin Ser Cys Asp 
50 55 60 

He Phe Phe Trp Pro Asp Leu Ala Ala Val He Val Gin Ala Ala Arg 
65 70 75 80 

Ala Tvr Phe Glu Gly Lys Glu Arg Leu Gly Ser Leu Gin Val Ala Glu 
85 90 95 

Asp He Thr Ala His Asp Pro Arg He Ala Pro Ala Ala Lys Arg Ala 
100 105 HO 

Val Ala Ala Ala Val Gly Leu Trp Thr Ala Leu Ser Glu Leu Val Gly 
115 120 125 



• •• 
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Gly Pro Asn Gly Glu Leu Glu Ser Lys Val Trp Gly Lys Gin lie Pro 

130 135 14 

Arg Ala Ala Ala Trp Glu lie Arg Asp Val Pro Lys Val Pro Val lie 
145 150 155 

Glv Pro Asp lie Leu Ser Phe Phe Ser Ala Ala Val Glu Leu Pro Val 

1 165 170 1 

Leu Tyr He Arg Ala Arg Gly Gly Ala His Ser Arg Ser Ala His Trp 
180 185 ±yU 

Asn Asn Gin Ser Ser Ala Pro Ala Ala Gly Leu Ala Ala lie Arg lie 
195 200 20b 

Gly Met Glu Met Val Arg Ser Leu Leu Val lie Ala Leu Pro Leu Ser 
Pro Glu Gly Ser Gin Asn Ser He 



225 



195 200 

Met Val Arg 
210 215 

n-v,^ T.a« p-to Glu Asp Leu Pro Glu 

230 235 



Arg Ala Phe Val Ala His Leu Met Asn Cys Val Ala Thr Asp Lys He 
245 250 Z3= 

Met Ser Pro Asp Val Arg Val Pro Val Glu Glu Ser Phe Tyr Ser His 
260 265 -i'u 

Cys Leu Arg Glu lie lie Met Cys Glu Arg Ala Phe Cys Tyr Pro Cys 
275 280 

Asn Pro Pro Pro Lys Trp 
290 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Met Glu Asn Met Leu Asp Gly Cys Tyr Pro Leu Ala Leu Met Asp Ser 
1 5 10 15 

Asp His He Thr Ala His Ala Val Pro Arg Gly Glu Arg Arg Arg Gin 
20 25 30 

Gly Ala Ala Val Ala Ser Ser Glu Ser Ala Asp Ser Val Asp Pro Cys 
35 40 45 

He A-g He Ala Ser Arg Leu Trp Arg Glu Leu Val Glu He Ser Ser 
50 55 60 

Glu Leu Lys Asp Gly Tyr Gly Glu Phe Thr Ser Ala Arg Asp Arg Arg 
65 70 75 

Asn Ala Leu He Ala Ala Asn Glu Arg Leu Arg Ser Ala Phe Leu Gly 
85 90 95 

Ala Ser Arg Ala Thr Arg Gly Leu Gly Leu Arg Pro Arg Trp Ala Ser 
100 105 HO 
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Thr Glu Ser Val Ala Asn Ser Pro Thr Asp Pro Asn Asn Gly Asn Gly 
115 120 125 

Leu Gly Glu Leu Glu Glu Ala Met Glu Gly lie Glu Gly Asp Phe Trp 
130 135 140 

Leu Asp Ser Leu Asp Gly Asp Arg Phe Glu Asp Glu Ser Arg Thr Met 
145 150 155 160 

Gin Ser Glu Asn Met Arg Phe Val He Glu Lys Glu Leu Leu Ser Trp 
165 l" 70 175 

Leu Ser Arg His Leu Pro Ala Asp Leu Ala Ser Ala Glu Arg Glu Thr 
180 1 -° 

Ser Arg Ser Leu Leu Ala Ala Gly His Trp Cys Cys Leu Trp His Pro 
195 200 205 

Arg Pro Cys Arg Glu Ala Cys Leu Tyr Asp Ser He Tyr Val Gin Ser 
210 215 220 

Leu Phe Cys Val Gly Thr Gly Arg Val Pro Gin Ser Glu Met Arg Arg 

225 230 235 

Arg Glu Tyr Leu Ala Ala Leu Arg Ala Gly Ala Ala Ala Ala Asn Ser 

245 250 255 

Pro Glu Val Ser Ala Ser He Phe Ala Arg Asp Ala Gly He Ala Leu 
260 265 270 

Ala Leu Ala Arg Arg Arg . 
275 



